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Preface 


This book deals primarily with the sample function behaviour of Gaussian, 
and related, random fields; i.e. stochastic processes whose arguments vary in a 
continuous fashion over some subset of 4, N-dimensional Euclidean space. 
The problems that arise in describing this behaviour in the multiparameter 
setting are qualitatively different to those covered by the one-dimensional 
theory. Indeed, most of the really interesting problems are of a geometrical 
nature, and actually disappear in the simple case. The purpose of this book is to 
collect within one cover most of the contents of the substantial literature 
devoted to the sample function analysis of random fields, including a reasonably 
full and self-contained account of the geometry needed for its understanding. 

While working on this book I tried to write with two readers in mind. One 
was a mathematician, for whom it was necessary to present what I feel is a 
rather beautiful theory, incorporating the standards of rigour and exactness 
that mathematicians are accustomed to. The other reader was an applied 
scientist, working in one of the many subjects that employs random fields as an 
element of mathematical modelling. His needs are quite different, for he 
requires a full statement of useful results, presented in such a way that he is not 
required to follow long mathematical proofs in order to understand and use 
them. It is possible that by attempting to please both readers I will have satisfied 
neither. Nevertheless, I beg the patience of both readers when they come upon 
parts of the text obviously written for the other. 

To be able to read the book from cover to cover a reader will need a good 
working knowledge (at graduate student level) of the basic features of modern 
probability theory and, for Chapter 8, measure theory. Although a brief review 
of those aspects of probability theory required is given in Chapter 1, it is 
unreasonable to assume that this can serve as anything other than a convenient 
reference for a reader already familiar with this material. However, with the 
applied scientist in mind, I have attempted to present the main results of the 
book in such a fashion that they can be understood and used by a reader whose 
knowledge of probability incorporates little more than the notions of expec- 
tation and stochastic convergence, 
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The material in this book breaks up into three main parts. Chapters 1 to 3 
cover essentially introductory and preparatory material. Chapter 1 contains 
an elementary discussion of random fields and some probability. Chapter 2 is 
devoted to the spectral representation of homogeneous fields. This material 
generally differs only in minor notational aspects from the spectral theory of 
stationary processes on the real line, and so very few proofs are given. Later in 
the book a variety of regularity conditions needs to be imposed on the sample 
functions of fields for certain results to be meaningful, and Chapter 3 is devoted 
to examining when these regularity conditions are satisfied. Also included in 
Chapter 3 is Garsia’s development of the Karhunen-Loève expansion for 
Gaussian fields. 

Chapters 4 to 7 deal with random fields possessing sample functions which 
are ‘smooth’ (continuous, differentiable, etc.). Chapter 4 is devoted to a 
development of the (integral) geometry and (differential) topology required to 
adequately study these fields. This is applied in Chapter 5 to develop a ‘level 
crossing’ theory for Gaussian fields. Chapter 6 looks at the important problem 
of the behaviour of Gaussian fields above very high levels. This is a subject 
which not only has substantial theoretical interest, but which is also extremely 
important in a wide range of applications. Chapter 7 repeats some of the 
analysis of the preceding two chapters for certain non-Gaussian fields on the 
plane. 

Chapter 8 looks at the third main topic: Gaussian fields whose sample 
functions are not smooth. Because of dichotomy results, which, briefly, state 
that Gaussian sample functions are either very smooth or extremely erratic, in 
this situation none of the analysis used in the smooth cases of the previous 
chapters is at all meaningful. To study sample function behaviour in this 
context the notion of Hausdorff dimension is introduced. A brief treatment of 
local time is also included. 

The book concludes with an appendix which gives an extremely brief treat- 
ment of Markov-type properties for random fields. Although this is virtually 
unrelated to everything else in the book, somehow the book seemed incomplete 
without some mention of this topic, so I included it. It is certainly a topic that 
often gives rise to questions from applied scientists, and is covered in a literature 
that is generally beyond their ken. 

Finally, I have the pleasure of acknowledging some of my debts. The contents 
of this book represent what I have learnt during the past seven years. I was 
exceedingly fortunate during that time to have been able to learn from some 
excellent teachers, sometimes in a formal fashion, sometimes just by watching 
and listening. Among those to whom I am most indebted are Chris Heyde, Joe 
Gani, and David Kendall. I am also extremely grateful to Ross Leadbetter for 
an invitation to visit Chapel Hill in the spring of 1979, during which time I had 
an opportunity to talk incessantly about random fields with many new-found 
friends. I am grateful to all those authors who provided me with copies of 


PREFACE xi 


unpublished work, and, in particular, to Don Geman and Joe Horowitz, 
without whose review paper on occupation densities Chapter 8 would have been 
almost impossible to write. Above all, I owe an enormous debt of gratitude to 
my teacher, Michael Hasofer, not only for originally introducing me to the 
subject of random fields but also for constant and consistent encouragement 
and assistance throughout the past seven years. 

During the period I worked on this book I was supported by a Queen 
Elizabeth II Fellowship at the University of New South Wales, and I am most 
its award. 

Without the support of my wife, Joan, and the tolerance of my daughters, 
Arielle and Tamara, I could never have finished this book. Without my parents' 
earlier support I could never have started it. I also owe a very special debt of 
gratitude to Mrs. Helen Langley. Not only did she produce excellent typescripts 
of the innumerable final drafts of the book, but had it not been for her constant 
demands of ‘When will this ..... thing be finished?' it would undoubtedly 
have been another two years in the writing. 


March 1980 R.J.A. 


CHAPTER 1 


Random Fields and Excursion Sets 


11 INTRODUCTION 


The study of random fields is, by definition, the study of random functions 
defined over some Euclidean space. Consequently, this study can cover an 
extremely wide area, since any question that can be asked about an ordinary 
non-random function, or class of functions, can just as readily be asked about 
their random counterparts. Hence, the general theory of random fields is certainly 
at least as large as the general theory of functions, and this theory, as any student 
of modern mathematics is well aware, is voluminous. Indeed, adding a random 
component to the theory of functions makes it a much larger, more interesting, 
and often more complex subject. Thus it is clearly necessary in any treatment 
of random fields to restrict one's interests to reasonably specific areas within 
the general theory. The areas we shall choose to concern ourselves with will 
be those most immediately suggested by applications. That is not to say that 
applications of the theory we shall develop are of primary or central concern to 
us. Nevertheless, it is a broad truism in the study of stochastic processes that the 
processes which are developed to provide mathematical models of real-life 
phenomena generally are not only the more useful but also generate the most 
interesting mathematics. 

Thus, let us commence by looking at two examples of random fields, and see 
what type of questions they prompt us to ask. Consider firstly a flat piece of 
metal that has been ground to a fine degree of smoothness. Running a finger 
over such a surface one would be surprised to learn the fact, well known to 
metallurgists, that on a microscopic scale such supposedly ‘smooth’ surfaces 
are in fact extremely rough, being composed of a multitude of alternating ‘hills’ 
and ‘valleys’ of various dimensions. Imagine two such nominally flat surfaces 
in contact, subject to various forces, and try, furthermore, to imagine attempting 
to solve a variety of engineering problems related to the friction, resultant force, 
etc., generated by such contact. The first fact that must be taken into considera- 
tion is that because of the microscopically rough nature of the two surfaces in 
contact, actual contact between the two surfaces only occurs when the "hills 
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of one surface adjoin the ‘hills’ of the other, so that most of the engineering 
problems of substance are related to these hills. Hence we must first ask how 
many such hills there are, how high they are, what is their size, and so forth. 
Unfortunately, even for a given pair of surfaces, these questions can be answered 
neither from theoretical nor experimental investigation, and we are forced to 
turn to mathematical modelling. 

To do this we parameterize some hypothetical 'zero plane' that passes 
through such a surface by the points (t;, t2), and let X(t,, t2) denote the per- 
pendicular distance from the point (t4, t,) to the actual surface. That is, X(t,, t2) 
represents the height of the surface above some hypothetical zero level. To 
allow for the fact that we do not know the exact form of the two-parameter 
function X, we allow it to be random in some sense, and thus generate what we 
shall call a two-dimensional random field. The questions we have asked above 
now need to be put in a slightly different form: viz. what are the distributional 
(statistical) properties of the hills of the random field and, given these, how can 
they be used to answer the original engineering problems? Of course, to provide 
answers to such questions it is necessary to carefully formulate exactly what 
type of randomness can be associated with the field X(t,, t,), a decision generally 
taken to yield models which on the one hand are consistent with known proper- 
ties of real surfaces, and on the other hand are mathematically tractable. 

In general, problems related to ‘hills’ and ‘valleys’ are extremely important 
in most applications of random fields. Hence we shall now formalize these 
concepts by introducing the notation of an excursion set, a concept that will 
be of central importance throughout this book. We define the concept first for 
arbitrary, non-random functions F(t) from the N-dimensional Euclidean space 
A` to the real line 2+. 


Definition 1.1.1 


Let F(t): 4" — 2?! be an arbitrary function. Then for any fixed real number u 
and any subset S of A“ we define the excursion set of the function F above the level 
u in S to be the set 


(1.1.1) AF, S) = {t e S: F(t) > u}. 


When there is no likelihood of confusion about which set or function is 
being considered, we shall write this simply as A,, or even A. 

Our primary interest will lie in situations in which the underlying function F 
is in fact random, in which case the excursion sets A,(F, S) and their character- 
istics will also be random. Although it is not clear at this stage that all the 
questions we have asked about hills on rough surfaces are related to excursion 
sets, it is certainly clear that if the only hills of importance turn out to be those 
above some predetermined level u, then these hills lie on the excursion set Au. 
so that these sets must have some relevance. 
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There is in fact a wide variety of questions that one can ask about excursion 
sets, and in Section 1.7 we shall give answers to some of the simpler ones. For 
example, we shall consider questions related to the content of an excursion set, 
as well as the size of its boundary, and also see how simple questions lead to the 
far more complex and intriguing ones that are the central concern of this book. 

For a second example of a random field we shall take a more familiar object, 
an ocean surface. Again, let us parametrize some hypothetical zero plane 
passing through an ocean surface by the points (t,, t2) and let X(t,, t2, t3) 
denote the height of the ocean above the point (f,, t2) at time t4. Again it is 
reasonable to model X as being random in some sense, and so we have a three- 
dimensional random field. If we hold time, t4, fixed, we obtain again a two- 
dimensional field, about which we could ask questions similar to those asked of 
the rough surface model of metals. Thus, once again, we face problems related 
to excursion sets lying in the plane. However, in this example, we have a two- 
dimensional surface that changes with time, so that for a large variety of applica- 
tions it is more natural and useful to consider the three-dimensional excursion 
sets A, = {t = (ti, t5, t3): X(t) > u}. The geometry of these sets is naturally 
quite complex, and in order to study it properly we shall need to delve into the 
related subjects of integral geometry and differential topology. These problems 
will be tackled in Chapter 4. 

In a certain sense, a model of a sea surface immediately gives ríse to intriguing 
questions that do not automatically arise for the metal surface model. For 
example, we can ask if it is reasonable to assume that the random surface 
X(t,, t5, t4) is continuous in its three parameters. If it is continuous, can we also 
safely assume it is differentiable as well? Certainly observation of the roughness 
of a real ocean surface would tend to make us think carefully before making 
such assumptions. It turns out that in the case of ocean surface models these 
assumptions can in fact be justified, but this is not always the case. Thus we are 
led, in Chapter 3, to investigate what sort of preconditions we need to place 
on our mathematical models to ensure that conditions such as continuity 
and differentiability hold, while in Chapter 8 we shall investigate what happens 
when such conditions are violated. 

Random surfaces that are continuous but not differentiable seem, from the 
applied viewpoint, to be in that pathological area of the theory of functions that 
is the sole playground of pure mathematicians. That this is not completely true, 
and that such surfaces have a very useful and important role to play in the 
mathematical modelling of real-world phenomena, has been very ably argued 
in Mandelbrot (1977). There it is shown that these seemingly freakish functions 
arerelevant to modelling in such diverse areas as geography, biology, and turbu- 
lence. 

Non-differentiable, continuous surfaces are generally so rough and erratic 
that the exeursion sets generated by these surfaces tend to be composed of an 
uncountably infinite number of components. Nevertheless, it turns out, 
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perhaps somewhat surprisingly, that\a large number of useful, answerable, 
questions about such surfaces are intimately tied up with these sets. 

This is all we shall have to say about applications of random fields, since 
deeper discussion of a specific application almost always requires a detailed 
knowledge of the area of application, and this is not the place to develop such 
knowledge. The reader interested in applications may like to look at some of the 
following papers, which, although by no means exhaustive, cover a broad 
spectrum of disciplines. The serious study of metallic surfaces via random 
field models seems to have begun with Greenwood and Williamson (1966) and 
Whitehouse and Archard (1970). More recent studies include those of Nayak 
(1971, 1973a, 1973b) and Nuri and Halling (1976), as well as O’Callaghan and 
Cameron (1976). Random field models of ocean surfaces date back to the now- 
classical work of Longuet- Higgins (1952, 1957) and Cartwright and Longuet- 
Higgins (1956). Full reviews of this well-developed area are given in monographs 
by Kinsman (1965) and Sylvester (1974). Random fields have been looked at, 
from an electrical engineering viewpoint, by Wong and Tsui (1977) and Wong 
(1974, 1978). They have been applied to problems in forestry by Matérn (1960), 
geomorphology by Mandelbrot (1975a), geology by Harbaugh and Preston 
(1968), turbulence by Batchelor (1953) and Mandelbrot (1975b), and seismology 
by Robinson (1967). Random fields, generally of a quite different nature to those 
we shall be most concerned with, have also found applications in various 
branches of theoretical physics. In image analysis they have been used by 
numerous authors, including Wong (1968), Ahuja (1977), Panda (1977a, 1977b) 
and Schachter (19792, 1979b). 

Before we commence our own study of random fields we shall take time out 
for a brief review of the probability theory we shall require. In Section 1.7 we 
shall return to random fields to take a look at some of the more simple problems 
associated with them. 
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The following five sections contain a brief excursion through the mathe- 
matical foundations of probability and random fields. Since they have been 
included solely for the sake of completeness, and contain quite standard results, 
proofs are not given. A reader who either wishes to proceed at a more leisurely 
pace, or finds himself confronted by unfamiliar material, would be well advised 
to turn to a more complete treatment elsewhere. Good basic treatments are 
included in Cramér and Leadbetter (1967), Wong (1971), or Gihman and 
Skorohod (1974). 

We start, as is usual, with a measure space (Q, FZ, P): Q is a set with generic 
element w, ¥ is a o field of subsets of Q, and P is a probability measure on F 
satisfying the basic axioms 

0< P(A) - 1, P(Q) l, 


PAO B) PCY) POR) whenever AG RB dh, 
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where A, Be F and $ denotes the empty set. The elements of F will generally 
be referred to as events. 

Let 2/4 ,d > l,denote Euclidean d space; i.e. the set ofd-tuplesx = (x,,..., Xa) 
of real numbers equipped with the standard Euclidean norm 


Ixl = Q2 4 + xg). 


We shall always take x to be a row vector, with x’ its transpose. Define a 
half-open interval in 47 to be a set of the form 


I = (as bi] x +++ x (aa, bal, 


and write Z° to denote the Borel c algebra generated by the half-open intervals 
in 2°. Now let X bea measurable mapping from (Q, 4) into (2°, 22^), so that for 
every Borel set B e Z’ the set 


X^ (B) = (o € Q: X(v) € B} 


is an element of F. Then X is called an &4-valued random variable, or, more 
simply, variate. For each œ, X(o) is a vector of length d. We shall often write its 
coordinates as X,(),..., Xœ). Since each X; is a measurable mapping from 
(Q, F) to (#', 44.) it is also a well-defined random variable. When d = 1, X 
is said to be real valued, and when d = 2 we can, if we so desire, consider X to be 
complex valued with obvious conventions. 

The measurablility of X induces a probability measure Fx, or simply F, on 
B’, defined by 


Fx(B) = P{X~ (B) 
for every B e 2°. To this measure there corresponds a point function which we 
also denote by F, defined by 
F(x) = F(= œ, x1] x +++ x (7-09, x]. 


This function is called the distribution function of the random variable X. 
Using A,, or simply A, to denote Lebesgue measure on 44 we have that if F is 
absolutely continuous there exists a non-negative Borel function fy: 4^ > 4e 
for which 


Fx(B) = [sco dA(x) 


for any B € 4^. This function is known as the probability density function of the 
random variable X. 

If X is an Z^-valued random variable and g: #4 > 2" is a Borel function (so 
that g^ (B) e B for every B € 4"), then Y = g(X) is a measurable mapping 
from (Q, F) to (Z", 2"), and so is an Z"-valued random variable. Its cor- 
responding measure Fy is defined by 


Fy(B) = Fyig (Bj = PIX '(g '(B))} 
for Bu 4". 
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Since the notation 
P{w@: X(w) € B} 
is unduly clumsy, we shall generally use the simpler, but less exact, notation 
P{X € B} 
instead. 
As the interesting properties of random variables are probabilistic we do not 


differentiate between two variables which are equal with probability one. 
Hence, if 


P{X = Y} = 1, 


we call X and Y equivalent and consider them to be indistinguishable. 

Consider now a sequence of real-valued random variables {X,,}. For each 
w € Q the sequence ( X,(o)) is simply a set of real numbers, so that the inferior 
and superior limits 

lim inf X,(@) = lim inf X,(@) 


n- oo no kzn 
and 


lim sup X,(@) = lim sup X,(o) 


no no kzn 


always exist. Suppose these limits are finite for all œ outside some set N c Q 
for which P{N} = 0. Then we can define the random variables 


X, — lim inf X,, X* = lim sup X, 
by setting X,() = lim inf X,(@), X*(@) = lim sup X,(o) for c£ N, and 
choosing any value we like for X,(w) and X*(q) if œ € N. If the inferior and 
superior limits of a sequence {X,,} of random variables agree, we say the sequence 
converges almost surely (a.s.) or with probability one, and set 


X = lim X, = lim inf X, = lim sup X,. 


We write this as 


X, 3X o X, X as.- 


Let us now note that any #*-valued random variable can in fact be regarded 
as a collection of d real-valued variables, by considering the d coordinates of X 
separately. When considering X in this light the function F(x) is called the joint 
distribution function of these variates, and questions of dependence, independence, 
and conditional probability arise. We call two 4" and .#@"-valued random 
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variables X and Y, defined on the same probability space (Q, ¥, P) independent 
if 
P{X € B4, Y¢B,} = P{X € B,)j PYe Bj) 

for all B, € 2", B, e Z". The conditional probability that X € B,, given Y € B, is 
defined by 
P(XeB, Ye B,} 

P{Y € B} 
for all B, € Z" and B, € Z" for which P{X € B,} > 0. This leads naturally to 


the concept of a conditional distribution function for X given Y, defined, in 
measure form, by 


P{X € B,|XeB,} = 


Ja. dF x, v (X. y) 
I B AF y(y) 
where F x y, is the distribution function for the @"*™-valued variate (X, Y). 


If all the distribution functions are absolutely continuous, we can define the 
conditional density for X given Y as 


Fxiy(B4|B;) = ls. 


fix,x(% Y) 
RO) C 


Finally, note that we call two events (elements of F) A and B independent 
events if 


fxiv (X, p= 


P{A n B} = P{A}P{B}, 
and define the conditional probability of A given B by 
P{A ^ B} 

P{B} 


whenever P{B} > 0. It is left undefined, or given an arbitrary value, when 
P{B} = 0. 


P{A|B} = 


1.3 EXPECTATION 


If X is an Z^-valued random variable and g: #4 — #" a Borel function, then 
the expectation of g(X) is defined as the Lebesgue-Stieltjes integral 


E(g(X)) = faxo dP(o), 


provided the integral exists. Because of measurability, this is equal to the Stieltjes 
integral given by 


Eig(X)j = f g(x) dFx(x). 
PI 
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Since g(X) = (g,(X),..., Gn(X)) is Z" valued, so is E{g(X)}. Noting this, it is 
easy to see that E{g(X)} exists if, for i = 1,...,m, 


f gi (X) dF(x) < oo and f gi (X) dF(x) < oo, 
gå gå 


where 


y* = max(0, y), y~ = min(0, y). 

If X is a real-valued random variable and E{|X|} < oo, X is said to be 
integrable. If E{|X|?} < oo, we call X square integrable. The simple expecta- 
tion uy = E{X} is known as the mean of X, and, if X is real valued, we call the 
quantity o2 = E{|X|?} — u4 its variance, while o is known as the standard 
deviation. The product expectation o?y = E((X — uyXY — uy)} involving 
two variables X and Y is known as their covariance, while pyy = o%y/(oy 6y) is 
called their correlation. 

For the remainder of this section let us assume that X is a real-valued random 
variable. The following results on sequences of integrable variables will be of 
considerable importance to us in the future. All are counterparts of standard 
results on sequences of integrable functions in integration theory. 


Lemma 1.3.1 (Monotone convergence) 


Let (X,) be a sequence of non-negative random variables for which X,(@) < 
Xn+1(@) for all n and œ, and suppose there exists a random variable X such that 
X, > X as. Then 


E{X} = lim E{X,}. 
Lemma 1.3.2 (Fatou's lemma) 


Let X, bea sequence of random variables, and suppose there exists an integrable 
random variable X such that X,(@) > X(@) for all n and o. Then 


lim inf E(X,) > 2L inf x, 


n- oo na 


Lemma 1.3.3 (Dominated convergence) 


Let {X,,} bea sequence of random variables and X suchthat X, > X as. Suppose 
that there exists an integrable random variable Y for which |X ,(w)| < Y(o) for 
all n and w. Then 

lim E{X,} = EIX. 


n'a 
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Lemma 1.3.4  (Hólder's inequality) 


Let X and Y berandom variables, and p,q such that p > 1,q > 1,1/p + 1/q — l. 
Then 


E(XYI) < [E(X P} "DEC X19)" 


When p = q = 2 this is known as the Cauchy-Schwartz inequality. A simple 
consequence of the Holder inequality is the following lemma. 


Lemma 1.3.5 


Let X,,..., Xy berandom variables. Then 


EUX, vXVdS TIlE xt. 


Lemma 1.3.6 (Jensen’s inequality) 


Let X be a random variable and V a convex function on (— oo, oo). That is, for 
any À e [0, 1] and u, v E (— œ, ©), u < v, 


Wu + Aw — u)) < A[V (v) — v2]. 
Then if both E{|X |} and E{|Y(X)|} are finite 
WE{X}) € EQU(OX)]. 


The expectation E(|X |") is called the mth-order moment of X. Moments are 
often quite useful in obtaining probabilistic bounds on random variables. The 
following inequality is known as Markov's inequality and holds for any € > 0 
and any m > 0: 


(1.3.1) P{|X| > £} «c"E(XpP. 
When m = 2 this result specializes to give Tchebychev’s inequality 
(1.3.2) P{|X — ux| > £y} € € 7. 

An inequality of a similar nature, but providing lower bounds is 


(1 — ey'ui 
c? ; 


(1.3.3) P{X > suy) > 


For a proof of this result see, for example, Kahane (1968, p. 6). 
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1.4 STOCHASTIC CONVERGENCE 


There are four basic modes of convergence for sequences (X,) of random 
variables. We have already met almost sure convergence. For convenience we 
shall define it once again with the other three. 


Definition 1.4.1 


(a) X,, is said to converge to X with probability one, or almost surely, if there 
exists a set N c Q such that P{N} = 0 and, for every o ¢ N, 


lim ||X,(@) — X(@)|| = 0. 


(b) X,, is said to converge to X in vth mean (v > 0) if 

E(|X, — XI" — 0 as n > oo. 
(c) X, is said to converge to X in probability if, for every € > 0, 

P(|X, — XI > 2) ^0 as n — oo. 
(d) X, is said to converge to X weakly, or in distribution, if 
Fx, (x) > Fx(x) as n> oo 
at every continuity point X of F x(x). 
These four modes of convergence are denoted symbolically by 
X,55X, X,5X  x,5X X 5x 


respectively. Convergence in vth mean when v — 2 is of particular importance, 
and is known as mean square convergence. It is denoted by X, => 

In each of the first three cases there is a corresponding mode of mutual 
convergence of a sequence (X,) which is often useful. A sequence (X,] is said to 
converge mutually almost surely, mutually in vth mean, or mutually in proba- 
bility, if (1.4.1), (1.4.2), or (1.4.3), respectively, holds: 


(1.4.1) sup |X, — X,| **0 — asn oo, 
mèn 
(1.4.2) sup P(|X,, — X,ll > ¢} > 0 as n > oo, Ve > 0, 
mzn 
(1.4.2) sup E{||X,, — X,||"} ^ 0 as n oo. 
mzn 


In each case the mutual convergence of a sequence of random variables in a 
particular mode implies the existence of a limit random variable to which the 
sequence converges in that mode, and vice versa. Since mean square convergence 
will be of particular importance to us in the following chapter, we shall introduce 
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some special notation for it. If {X,,} is a sequence of random variables converging 
mutually in mean square, we shall denote its limit, X, by 


(1.4.4) X = Lim. X, 
n- oo 
where l.i.m. is read limit in mean. 

In the following lemma we summarize a number of results relating the 
various modes of convergence and giving sufficient conditions for convergence. 
Firstly, however, let us introduce two new concepts. À random variable X is 
said to be degenerate at X if P(X = x} = 1. The characteristic function $y(t) of 
an #4-valued random variable X is defined by 


x(t) = ae J 
= [oix sharo 


The distribution ofa random variable is completely determined by its character- 
istic function, and vice versa. Indeed, we can ‘invert’ this defining relationship 
as follows. Let B = [a,, b] x -+ x [a,, ba] be an interval in # such that the 
distribution function 
F(x) = lim F(x,,..., xj) 

Xk 

k*j 
of X ;is continuous at a; and b;, for each j = 1,..., d. Then, writing F in measure 
form, we obtain the inversion formula 


—iajtj .— e * jt 
(1.4.5)  Fx(B) = lim Us asl -fù (= 7 CL) exo dt. 


This, of course, uniquely determines F over most intervals of interest. For a 
proof see, for example, Lukacs and Laha (1964). 


Lemma 1 4.1 
Let (X,) be a sequence of random variables: 


(a) If X,*5 X then X, 5 X. 

(b IfX,>XthenX, 5 X. 

(c) If X, 5 X then there is a subsequence (X,,) of (X,) such that X,, *5 X as 
k> oo. 

(d) If X, > X then E{||X,|"} > Et|X|'3 as n > œ for all r < v. 

(e) If X, 5 X and X is degenerate, then X,, 5 X. 

(f) X, > X if. and only if, dx (€) > y(t) for every t. 
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(g) (Continuity theorem) If $x,(t) > (t) as n — oo for all tand $(t) is 
continuous at t = 0, then X, 2 X where X is the unique random variable 
with the characteristic function ©. 

(h) {X,,} converges almost surely if either (1) or (ii) holds: 


(i) Ya SUPmen PIX, — X,l) = £} < oo for every e > 0. 
(ii) n=1 P(IX, 4, — Xl > 6,} < oo where 6, > Oand Y £; Ôn < oo. 


Further results, relating convergence of means to almost sure convergence, 
were given in Lemmas 1.3.1 and 1.3.3. Results about almost sure convergence 
are invariably based on the Borel-Cantelli lemma. To state this we need to 
define the inferior and superior limits of a sequence of sets A, as 


A* = lim sup A, = N UA, 


noo n=1k2n 


A, = liminf 4, = (J ( A. 


noo n=1k2n 


If A,, A2,..., are measurable sets (events) in (Q, F, P), then A* and A, are also 
measurable and 


P {lim inf A,} x lim inf P{A,} x lim sup P(A,) < P(lim sup A,}. 


Furthermore, P{A*} is the P measure of the set of œ € Q which belongs to 
infinitely many ofthe A, or, in probabilistic terminology, the probability that an 
infinite number of the events A, will occur. 


Lemma 1.4.2 (The Borel-Cantelli lemma) 
Let (A, be a sequence of events. 


(a) If}, P(A,) < oo, then P(lim sup A,} = 0. 
(b) If Y. P{A,} = oo and, moreover, the events A, are independent, then 
P(lim sup A,} = 1. 


We conclude this section with a statement of the Kolmogorov zero-one law. 
Let {X,} be a sequence of random variables defined on (Q, F, P). For any X, 
set o(X,) to be the o-field of all sets of the form (c: X,(@) e B), B e 4^, and 
define o(X,, X,.1,---) similarly. Then a set Ee F will be called a tail event 
if E € o({X,, X,4,,--.) for every n. The collection of all tail events is called the 
tail o field of the sequence {X,}. 


Lemma 1.4.3 (Kolmogorov’s zero one law) 


Let {X,,} be a sequence of independent random variables. Then if E is a tail event, 
PUE is either zero or one. 
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1.5 RANDOM FIELDS 


There are two virtually distinct approaches to defining random fields. One is 
essentially a measure theoretic approach and leads ultimately to a probabilistic 
setting, while the other starts probabilistically and can be eventually placed ina 
measure-theoretic setting. Let us consider the measure-theoretic approach first. 
Let G^? denote the set of all @4-valued functions on 4", N,d > 1,and 9“ the 
c-field containing all sets of the form {g € G"^: g(t) e Bj, j = 1,..., m) where 
m is an arbitrary integer, the t; are points of 2", and the B; are half-open inter- 
vals in 44. Then, much as we defined random variables, we define an (N, d), 
or N-dimensional, random field, or stochastic process,to be a measurable mapping 
X from (Q, 7) into (G"*7, 4-4), We use the notation X(t, œ) to denote the value 
the function in G":*^ corresponding to c takes at the point t. When convenient, 
we shall suppress either the t, the œ, or both. 

Given the existence of the probability measure P on ¥ we can immediately 
obtain from this definition a collection of measures F., on B defined by 


(1.5.1) F,,.... 1B] = PUX(),..., X() € Bj 


for any B e &™. The collection of all such measures, or, equivalently, the 
corresponding distribution functions, is known as the family of finite-dimensional 
( fi-di) distributions for the field X. In general it is the fi-di distributions that we 
work with in the study of a random field. Hence it is comforting to know that 
the fi-di distributions of any given random field uniquely define the P measure 
of all sets in the c-field 4*7. Not all events of interest are in 4:4, however. For 
example, sets of the form 


TEE 


{g: g(t) e B for all t e I} 


where B e # and I is an interval in Z^, are not usually in 4:4, although they 
are of obvious interest. To obtain the probability of such a set from knowledge 
of the fi-di distributions only it is necessary to assume some other condition, 
such as separability, which we shall encounter soon. 

The second definition of a random field, which is more natural in a modelling 
context, is to define it as a collection of random variables X(t), t e ^, together 
with a collection of measures or distribution functions of the form F,,... , on 
B", n = 1,2,...,t; € 4" which satisfy (1.5.1). The natural question to ask is 
whether or not one can always find a random field, according to the first 
definition, which possesses these measures as fi-di distributions. A celebrated 
result of Kolmogorov is that a necessary and sufficient condition for the existence 
of such a field is that the given family of measures satisfies the following two 
conditions: 


(a) Symmetry. Writing F as a distribution function Fy, |. ,(x,,..., X) F 
should remain invariant when the x; and t; are subjected to the same 
Permutation, 
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(b) Consistency. Fe... (Bx &™) = Fa 


Be gn 


(B) for every n, m 2-1 and 


etnim 


sts 


Clearly, not every family of measures corresponds to a random field. Hence- 
forth, however, we shall only deal with families that do. 

When dealing with random variables we called two variates X and Y equiva- 
lent if P{X = Y) = 1. This implies that, for all intents and purposes, these 
variates were indistinguishable. Two random fields X(t) and Y(t) are said to be 
equivalent fields, or versions of each other, if 


P{X(t) = Y} = 1 for every te A”. 


Two equivalent processes generate equivalent measures on €": but are not 
necessarily indistinguishable in every sense. For an example of this, set N — 
= 1, Q = [0, 1], and 


P{B} = Í dA(x). 


Bo[0, 1] 


Then define the two processes 


(1.5.1) X(t,0) = 0 for all t, œ 
0 t zoo 
Y = 
(1.5.2) (t, c) f t-o 


Then X and Y are clearly equivalent processes, but whereas 
P{X(t) is continuous for t € [0, 1]} = 1, 
P{ Y(t) is continuous for t € [0, 1]} = 0. 


Thus certain properties of the processes are quite different. 

For a given w e Q, X(-, œ) is simply a deterministic 4^-valued function on 
AN, which we refer to as a realization of the field X. The set in " *? determined 
by the points {(t, X(t)), t € 2%} is called a sample function, or sample path, of X. 
Thus the difference between the equivalent processes X and Y defined above 
could be said to be a difference in sample path behaviour. Since the sample path 
behaviour of random fields is the central concern of this book, and since this is 
not necessarily determined by fi-di distributions which are the basic building 
blocks with which we shall work, it is necessary to introduce the assumption 
of separability. Basically, this condition, due initially to Doob (1953), ensures 
that fi-di distributions in fact do determine sample function properties by 
requiring that sample functions are essentially determined by their values on an 
everywhere dense, countable set of points in the parameter space A". More 
formally, we have the following definition. 
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Definition 1.5.1 


A random field X(t), te A”, is said to be separable if there exists a countable 
set D c 4?" and a fixed event N for which P{N} = 0, such that for any closed 
interval B c 4^ and open interval I c @ the two sets 


{o: X(t, o) e B,te I) and (o: X(t, o)eB,telI ^ Dj 
differ by a subset of N. 


A simple consequence of separability when d = 1 is that the following two 

equalities hold with probability one: 

sup X(t) = sup X(t), inf X(t) = inf X(t). 

teDT teT te DT teT 
It is easy to see from this that the process Y(t) defined by (1.5.2) is not separable. 
Throughout the remainder of this book we shall assume, without further 
statement, that we are dealing with separable fields only. This is not a serious 
assumption, since it can be shown that to every stochastic process there cor- 
responds an equivalent separable process. 

We conclude this section by introducing an extremely powerful concept in 
the study of random fields known as homogeneity. 

This concept and some of its implications are discussed more formally, and in 
more detail, in the following chapter. However, since we shall require the concept 
before then, we shall now agree to call a real-valued random field X(t) strictly 
homogeneous (or stationary) if, for arbitrary k, any real numbers x,,..., x, and 
any (k + 1) points c, t,,...,t, in Z" the following condition on its fi-di distri- 
butions is satisfied: 


PiX(t;) s X», X(t) < Xp} = P{X(t, + T) s Xp -> X(t; + T) s Xu} 
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An important special class of random fields is the class of Gaussian fields. 
It will soon become clear that in many ways this is primarily a book about 
Gaussian random fields, in that, whenever we attempt to derive explicit solu- 
tions to difficult problems, we are usually only able to do so if the field that 
generated the problem is Gaussian. The main reason for this is the convenient 
analytic form of the multivariate Gaussian density, which makes it possible to 
obtain explicit results for Gaussian fields that seem almost impossible to derive 
for more general processes. 

A real-valued random variable X is said to be Gaussian (or normally distri- 
buted) if it has finite mean u = E{X} and variance g? = E(|X — u|?) > 0, and 
its distribution function is given by 


< a” 


x — 2 
(1.6.1) F(x) = PIX < xj = f Qno?) 42 e| - 1G JD IE: 


— 
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Equivalently, X has the density function 
fx) = Qno?) 1? e| - di 
and characteristic function 
x(t) = exp(itu — 3t?o?). 


We abbreviate this by writing X ~ N(u, o?). The case u = 0, o? = 1 is rather 
special, and in this situation we say that X has a standard normal distribution. 
We shall always use D(x) to denote its distribution function, i.e. 


(1.6.2) (x) = Qn) !? f exp( — ix?) dx. 


An &*-valued random variable X is said to be multivariate Gaussian if for 
every d-tuple of real numbers («,,..., %4) the real-valued variable Y = V2. , o; X 
is Gaussian. In this case the probability density of the d-dimensional vector X 
is given by 


(1.6.3) x(x) = Qn) "?|V| !? exp -3(x — V^ (x — 97] 


where p is the d vector with elements u; = E{X,} and V is the non-negative 
definite d x d covariance matrix with elements 


oy = FRx, = E(XG — MX; — up). 


Note that Gaussian distributions are completely determined by their first- 
and second-order moments. 

It is relatively straightforward to check from the definition that the correspond- 
ing characteristic function is simply 


(1.6.4) y(t) = exp(itu" — 4tVt*). 
Many important properties of multivariate Gaussian variables 
X = (X,,...,X4) 


relate to the distribution of subsets of the X ;. For example, any n, 1 < n < d, 
of these variables also have a joint (n-dimensional) Gaussian distribution. Thus 
if we partition X into (X+, X?) = ((X ,,..., X,,(X,.,,..., X), pinto(g!, p?) = 
(44, wn > Hn) (Hn + losers Ha)), and V into 


where V,, is ann x n matrix, then each X! has mean vector p! and covariance 
matrix Vj. Furthermore, the conditional distribution of X! given X? is also 
Gaussian, with mean vector 


(1.6.5) Bic gu c Vi, Va 7 OC o any 
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and covariance matrix 

-1 
(1.6.6) V2 = Var — Via V22 Vase 


We can now define a Gaussian random field to be a random field possessing 
fi-di distributions all of which are multivariate Gaussian. From this definition 
and the above comments it is clear that all the fi-di distributions of a real- 
valued Gaussian process, and hence the measures they induce on 4":4, are 
completely determined once we specify the following two functions, known, 
respectively, as the mean and covariance functions: 


w(t) = E{X(0}, 
R(s, € = E([X(s) — w(s)] TX) — ny}. 


It can be easily seen from the form of the multivariate normal density that if a 
real-valued Gaussian field has constant mean and a covariance function that is 
dependent on s — t only, then the field is homogeneous. 

A final, important, fact about real-valued Gaussian variates is that if X and Y 
have a bivariate Gaussian distribution and their correlation pyy is zero, then 
X and Y are independent. This property will be of considerable use to us in the 
future. 
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We have already noted that our central concern will be with the (random) 
excursion sets 


A,X, S) = (te S: X(t) 2 u} 


of a real-valued random field X(t), te 4". When N = 1 the field is simply 
a random function on the real line, and the sets A, become comparatively 
simple sets comprised of a number of disjoint closed intervals, some of which may 
be degenerate. The study of these intervals is a well-established area, which we 
shall review in Section 4.1. However, the one-dimensional theory of excursion 
sets is not actually based on these sets themselves, but is essentially concerned 
with the random point set of what are termed the upcrossings of a level u by 
a stochastic process. We define this concept as follows, for a general real-valued 
function, F. 


Definition 1.7.1: 


Let F(t) be a continuous function on [0, T] such that F(t) is not identically 
equal to u in any interval and neither F(0) nor F(T) equal u. Then F is said to have 
an upcrossing of the level u at the point to if there exists an > O such that F(t) € u 
in (tọ — & to) and F(t) > uin (to, to + £). 
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The number of such points to in [0, T] is called the number of upcrossings of u 
by F in [0, T], and is denoted by N (X, T) or simply N, when no confusion is 
likely. 

The relationship between the number of upcrossings of a continuous sto- 
chastic process and its excursion set A, = A,(X, [0, T]) is simple. If X(0) < u, 
then the number of upcrossings of u and the number of components of A, are 
identical, while if X(0) = u, the number of components of A, is one greater 
than the number of upcrossings. 

Since, in one dimension, level crossings provide the key to the study of 
excursion sets it is natural to ask whether or not we can study the same concept 
in the more general situation. The answer to this is relatively simple, but, 
unfortunately, negative. 

The generalization of the set of level crossings from one to many dimensions 
must involve random point sets of the form 


{te S: X( = u} = eA, S), 


where 0A denotes the boundary of the set A. In two dimensions, for example, 
0A, is a family of contour lines in the plane. Because of this, it is clear that there 
is no immediately obvious generalization to random fields of the rather simple 
notion of the number of level crossings of a one-dimensional process. It is 
not clear, for example, how to distinguish between the ‘upcrossings’ and - 
*downcrossings' of a random field. Simply counting the number of disjoint 
contour lines (N = 2) or (N — 1)-dimensional surfaces (N > 3) in 0A, seems 
reasonable at first, but it turns out that this variable is not readily amenable to 
exact probabilistic investigation, so that it is not possible to build a full theory 
based on these sets. Nevertheless, some aspects of these sets can be profitably 
studied, and we shall thus commence our study of random fields by considering 
some elementary properties of A, and 0A,. 

The simplest aspect of A, to study is its content, or Lebesgue measure, 
1,(A,). Defining the indicator function I 4(x) by 


(1.7.1) L(x) = f Ta 


0 ifx <0 
it is immediate that 
(1.7.2) Ay(A,(X, S) = [Lew — u) dt 
S 


for any real u and an arbitrary set $ c 4". This representation permits a simple 
derivation of some of the statistical properties of the content of A,. For example, 
using this we have 


(1.7.3) E{AMA,)} = [et a X(t))} dt 
s 


[Pixo u} dt. 


wy 
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If the random field is homogeneous the integrand is independent of t so that in 
this case 


(1.7.4) E{Ay(A,)} = AKS)P(X(0) > uj. 

Similarly, it is not difficult to write down expressions for higher moments of 
Ay(A,). For example, 
(1.7.5) E(A3(A) = [ [Px > u, X(t) > u} ds dt. 


Let us now assume that the random field X(t) is a zero-mean Gaussian process 
with unit variance and covariance function R(s, t), and see if the expressions in 
(1.7.4) and (1.7.5) can be simplified. Equation (1.7.4) simplifies to 


E{A,(A,)} = AK): [1 — 9(0)]. 


where ® is the standard normal distribution function defined by (1.6.2). Since 
@(u) is a well-tabulated function, this is a simple expression and the result 
is a useful one. If we now write $(x, y: p) to denote the bivariate Gaussian 
density 


1 x? F 2pxy + “| 
x, y: p) = ——————À ex E APRIRE cheat E 
WARME pem | 20) 


then equation (1.7.5) becomes 


(1.7.6) E(A2(A) = T i (x, y; R(s, t)) dx dy ds dt 


When u = 0 we can further simplify this expression to 


(1.7.7) E(2(A,)} = Í Í d ses RG, 2n didi 


lias) To Ji [ arcsin[R(s, t)] ds dt 
SvS 


[cf. Kendall and Stuart, 1969, p. 351]. It may of course be possible to evaluate 
the integral in the above expression explicitly for a specific covariance function 
R, in which case one obtains a simple expression for the second moment of 
Ax(Ao). However, for any value of u other than zero, the integral 


i [ 4 y; p) dx dy 


appearing in (1.7.6) cannot be explicitly evaluated and no further analytic 
simplification of (1.7.6) is possible. A similar situation exists with regard to 
third moments; i.e. for u = 0 it may be possible (depending on R) to obtain an 
explicit formula for E(A$C44)1, while for u # O it is not possible. However, it is 
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never possible (even ifu = 0) to obtain explicit expressions for higher moments. 
(For the reasons for this, see Kendall and Stuart, 1969, pp. 351—354). 

This is a first introduction to a problem that will continually plague us. In 
general, it turns out to be virtually impossible to obtain explicit expressions for 
moments other than the first of most of the random field generated variables that 
we shall consider. It will also turn out, a fortiori, to be virtually impossible to 
obtain exact distribution results, except in certain limiting cases, some of which 
will be discussed in section 6.9. 

The second, and most obvious, property of excursion sets to study is the size 
of their boundaries, 0A,(X, S). Since 0A, is in general an (N — 1)-dimensional 
manifold in Z^, its size is best measured with Ay. , measure, so that the variable 
of interest is now Ay. ,(04,). As for Ay(A,), very little can be determined 
about the distribution of this variate, other than its first two moments, and even 
to obtain these it turns out to be necessary to call in other branches of mathe- 
matics. Thus, let us digress for a short while. 

We shall restrict ourselves for the moment to the two-dimensional case in 
which 04A, is simply the set of contour lines of X(t) in S and 4,(0A,) is their 
length. To obtain a usable expression for this we need a result from integral 
geometry. Firstly, we note that any straight line in the plane can be param- 
eterized by two coordinates (p,0), -œ € p « o0,0 x 0 « n, so that the 
equation of the line will be 


(1.7.8) ti cos 0 + t; sin @ — p=0. 


This line is of perpendicular distance p from the origin, and the angle between 
this perpendicular and the positive half of the t, axis is 0. Now, for compact 
S c & set N,(p, 0) to be the number of intersections of the line given by (1.7.8) 
with the set 04,(X, S). Then from a basic result of integral geometry (see Lemma 
6.4.1), we have the following, which holds if the sample paths of the field X 
satisfy the smoothness conditions called ‘suitable regularity’ which are discussed 
in detail in Chapter 3: 


(1.7.9) 1,(0A,) = 4 ia N,(p, 0) dO dp. 
07—o 


From this it immediately follows that 


(1.7.10) E(AQ0A) = 1 (ele E{N,(p, 0)) dO dp. 
07—o 


The value of this expression lies in the fact that the expectation in the integrand 
relates to the mean number of level crossings of the one-dimensional stochastic 
process obtained by restricting X(t) to the line (1.7.8) and, under the conditions 
to be discussed in Section 4.1, an expression for this is well known. We shall 
carry out the integration of (1.7.10) in Section 6.4, but let us note here that even 
when this can be done the final expression involves, in all but the simplest cases, 
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extremely awkward expressions. Similarly, although it is theoretically feasible 
to use (1.7.9) to obtain higher moments of 4,(04,), trying to do so, even for the 
second moments, results in final expressions that are always intractable in- 
tegrals. 

Apart from these purely technical drawbacks of studying the variable A«(0A,) 
there is a more fundamental problem which lies in interpreting results about it. 
For example, suppose we are told that for a particular random field X(t) defined 
on the unit square that E(A,(0A4,)) = 25. Does this mean that each excursion 
of X(t) has an average boundary length of about 25, or do there tend to be 25 
excursions per unit square, each with an average boundary length near one? 
The answer is not clear. 

Problems of this type first led Swerling (1962), and later Bolotin (1969) and 
Nosko (1973), to study the number of connected components contained in the 
excursion set of a two-dimensional random field. The results they were able 
to obtain were in a certain sense incomplete. For example, it was not possible to 
obtain the mean value of this random variable. Although bounds for the mean 
value could be obtained these bounds were not, however, very sharp, and in 
general involved expressions that could not be explicitly evaluated. In one 
sense, however, the work of these three authors is rather important, for in 
trying to study the topological concept of connected components they used some 
elementary ideas from the theory of differential geometry. As will become clear 
later it is through the related fields of integral geometry and differential topology 
that the problem of studying level crossings in higher dimensions can be suc- 
cessfully tackled. 

A large variety of other concepts has also been used to generalize the notion 
of level crossings to random fields, and we shall have cause to meet some of these 
later on. However, at this stage it is desirable to divert ourselves from this area 
for a while to devote some time to studying more fundamental problems. We 
commence in the following chapter by studying the spectral theory of homo- 
geneous random fields. This theory, although it possesses little intrinsic interest 
for us, will provide a number of very powerful tools that we shall use extensively 
later on. 

In Chapter 3 we consider various types of sample path regularity. So far we 
have always assumed that our random fields have almost surely continuous 
sample paths, and in later chapters further conditions will be imposed. Chapter 3 
is devoted to determining conditions under which the regularity conditions we 
shall require will in fact hold. Chapter 4 is devoted to setting out the geometry 
and topology we shall need to properly describe and classify excursion sets of 
both random and non-random functions. Finally, in Chapter 5, we shall be able 
to return to the problem of central interest: obtaining distributional properties 
of the excursion sets of random fields. 


CHAPTER 2 


Homogeneous Fields and Their Spectra 


We have already had a brief introduction to the concept of homogeneity 
for random fields in Chapter 1, but before we can progress further in studying 
the problems raised there it is necessary to devote some time in studying this 
concept and some of its ramifications in more detail. Although most of the 
results we are about to encounter will be extremely useful later on, the details of 
their proofs will not, so that they will not be given. The reader already familiar 
with the spectral theory of single-parameter processes will find little notably 
different in the spectral theory of random fields as discussed in the following 
four sections of this chapter. The level of treatment is such that a reader un- 
familiar with spectral theory and willing to accept results without proofs should 
not find the material hard to follow. 

A full and self-contained proof of the spectral representation theorem 
(Theorem 2.4.1) is given by Miller (1975), who uses a mathematically un- 
sophisticated style of proof and obtains the theorem only under stronger con- 
ditions. However, his approach may be preferred by a reader not familiar with 
the Hilbert space theory that most other proofs rely on. A spectral theory for 
processes defined on more general spaces than Z" is developed by Gihman and 
Skorohod (1974), who study processes defined over complete, separable, 
metric spaces, while Yaglom (1961) studies processes defined over certain 
separable, locally compact groups. Despite obvious notational complexities, 
all these theories are essentially the same as those for processes defined on 2+, 
full accounts of which are given, for example, in the monographs of Cramér and 
Leadbetter (1967) and Yaglom (1962). The spectral theory of homogeneous 
random fields was first rigorously established in Yaglom (1957). 


2.1 THE MEAN AND COVARIANCE FUNCTIONS 


Although our primary interest lies in the study of real-valued random fields 
it is mathematically more convenient, and often more useful, to develop spectral 
theory for complex-valued processes. Hence, unless otherwise stated, we shall 
assume throughout this chapter that X(t) takes values in the complex plane C, 


mI 
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while t lies in 2". We shall also assume that the expectation E | X(t)|? is finite for 
all t e BY. 
With these assumptions the mean function 


m(t) = E(X(0j 


now becomes a non-random function from 4" to C. We define the covariance 
function R(s, t): 27% > C by 


(2.1.1) R(s, t) = E(LX(S) — m()]LX(t) — m)) 


where the bar denotes complex conjugation. When X(t) is a real-valued field 
the conjugation has no effect, so that the above definition is equivalent to that 
given in Chapter 1 for real-valued processes. 

It is an immediate consequence of (2.1.1), the finiteness of E| X(t)|?, and the 
Cauchy- Schwartz inequality that R(s, t) is finite for all s, t € 2”. Furthermore, it 
is easily seen that R(t, t), which is the variance of X(t), is real and non-negative, 
and that the covariance function satisfies the relation 


R(s, t) = R(t, s) 


for all s, t e Z”. Indeed, much more than this is true, for it is also easy to see that 
every covariance function is a non-negative definite function on ?"'; i.e., if 
t,,...,t, is any collection of points of A~, and z,,...,z, are arbitrary complex 
numbers, then the Hermitian form 

k k 


Q.1.2) Y XR(tt)uz, 
i-1 j=1 
is always real and non-negative. This follows from the fact that this expression 


is equal to 
2 


k 
E y [X(t;) — m(t;)]z; 
i=1 

which must, of course, be real and non-negative. The property of non-negative 
definiteness actually characterizes covariance functions, so that given any func- 
tion m(t): 4^ > C and a non-negative definite R(s, t): 22% > C it is always 
possible to construct a random field for which m(t) and R(s, t) are the mean and 
covariance functions, respectively. Indeed, this field can even be taken to be 
Gaussian, since the multivariate Gaussian fi-di distributions generated by m 
and R are easily seen to satisfy the Kolmogorov conditions of Section 1.5. 

As we have already noted, a central concept in the study of random fields is 
that of homogeneity. We say that a random field X(t) is strictly homogeneous (or 
strictly stationary) if its finite dimensional distributions are invariant under 
translations in the parameter t. That is, for any set of points t,t,,..., t, in 4" the 
joint distribution of the k complex variables 


X(t), X(6)..... X(t) 
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should be the same as for the variables 
X(t, + 9), X(t, + 0$,..., X(t, + 1) 


An immediate consequence of this property is that the mean function m(t) 
is identically equal to a constant, which we shall hereafter take to be zero. This 
assumption reduces notational complexity for no real loss of generality. A 
second consequence of strict homogeneity is that the covariance function 
R(s, t) is forced to be a function of the difference s — t only. 

Very often, when dealing with random fields, it is not necessary to impose the 
rather restrictive condition of strict homogeneity, and it suffices to demand only 
those two of its consequences just noted. Hence, we shall introduce a class of 
fields which we shall call simply homogeneous or stationary, which satisfy 


(2.13) E(X(t)) = m = constant, 
C E([X(s) — m][X(t) — m]) is a function of s — t only. 


These fields are also known as second-order, wide-sense, or weakly homogeneous 
fields. In this situation the covariance function becomes a function of N variables 
only (rather than 2N). Nevertheless, we shall continue to denote it by R(t), 
allowing ourselves a slight but convenient and conventional looseness in 
notation, so that we can now meaningfully write R(s, t) = R(s — t). 

A strictly homogeneous field is clearly homogeneous, but in general the reverse 
is not true. However, it is true for certain Gaussian fields which satisfy the 
following condition. 


Theorem 2.1.1 


Inorder that a homogeneous, Gaussian random field be strictly homogeneous it is 
necessary and sufficient that the function 


H(s, t) = E(X(S)X(0; 


should be a function of s — t only. 


Note that if X(t) is complex valued H is not its covariance function, since we 
do not take the complex conjugate of the second factor in the expectation. On 
the other hand, if X(t) is real valued H = R, so that as a corollary to the theorem 
we have that every homogeneous real-valued Gaussian process is also strictly 
homogeneous. 

We conclude this section by noting that for homogeneous fields the Hermitian 
form given in (2.1.2) simplifies to 


k k 
Y Y R(t;  t)7;2;. 
1j 14 
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Since this must be non-negative for any set of t; € 4" and complex numbers z;, 
we have that the covariance function, considered now as a function on A”, 
is non-negative definite. It is useful to know that such functions can be character- 
ized, as in the following result due to Bochner (1933). 


Theorem 2.1.2 


A continuous function R(t) from 4" to the complex plane is non-negative 
definite if and only if it can be represented in the form 


(2.1.4) R(t) = Í exp(it - 4) dF(4), 
AN 


where t - denotes the inner product Y7- t;2; and F(A) is a bounded, real-valued 
function satisfying f a dF(X) = 0 for all measurable A c A”. 

Since every covariance function is non-negative definite, each such function 
has a representation of the form (2.1.4). The function F of the theorem is only 
specified up to an additive constant. When we fix F by demanding that 


F(—00,...,—00) 2 0 


(thus implying that F(oo,..., oo) = R(0)) we call (2.1.4) the spectral representa- 
tion of R(t). The function F(A) is then called the spectral distribution function for 
R(t) and is effectively a multivariate distribution function multiplied by R(0). 
When F(A) is absolutely continuous we call the corresponding density, f (A) say, 
the spectral density and (2.1.4) becomes 


Q.1.5) R(t) = [expat MFO) dà. 


As we shall see later, these representations of the covariance function are 
extremely helpful in investigating its properties. 


22 CONTINUITY AND DIFFERENTIABILITY OF RANDOM 
FIELDS 


Questions about continuity and differentiably of a function f(t), t e 2" at a 
point t* essentially boil down to questions about the convergence of sequences of 
the form {f(t,)} when lit, — t*|| ^ 0asn— oo. When the function being studied is 
actually a random field it follows that we are actually asking questions about 
the convergence of a sequence ( X(t,)) of random variables. Hence, just as there 
are various modes of convergence for random variables, there are various types 
of continuity and differentiability for random fields. We shall consider only two 
of them. 

The strongest form of stochastic convergence is almost sure convergence. 
Corresponding to this, we shall say that a random field X is almost surely 
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continuous at t* if for every sequence {t,,} for which ||t, — t*|| > 0 as n > oo, 
X(t) = X(t*). Alternatively, we could write this condition as 


Pío: ||X(t,, @) — X(t*, @)|| 2 0 asno o}= 1. 


We say that X is almost surely continuous throughout a set A c Æ" if it is 
almost surely continuous at each t e A. We shall also refer to this type of con- 
tinuity as sample function continuity. 

Similarly, if we write f; to denote the jth first-order partial derivative 0f/0t; 
ofa function f, we say that arandom field X is a.s. or sample function differentiable 
at a point t* if Pio: X (t*, œ) exists} = 1. Let 6,, 1 <j € N, denote the N 
vector, all of whose elements are zero, except for the jth, which is one. Then if X 
is almost surely differentiable at t* the a.s. limit 


XA) - lim X(t* + hà) — X(t*) 
h} 0 h 


is called the sample function jth partial derivative of X at t*. By allowing t* to 
vary we obtain the partial derivative field X (t). 

In what follows we shall generally wish to use the information contained in the 
covariance function to investigate the behavior of random fields. Thus, because 
covariance functions are essentially only second-order moments, if often turms 
out that the most natural form of convergence to use in the theory of random 
processes is mean square convergence. We shall now see how this mode of 
convergence can also be used to develop notions of continuity and differentia- 
bility for random fields, and how these relate to the covariance function. 

Again, let t,, t;,... be a sequence of points and t* a fixed point in 4" for 
which |t, — t*|| > 0 as n > oo. Then if 


X(t) ^S X(t*) as n >o, 


we say that X is continuous in mean square at t*. If this holds for all t* € A, where 
A is some subset of Z”, we say that X(t) is continuous in mean square over A. 
It is interesting to note that whether or not a field possesses this properly can 
be read off from the covariance function of X(t), as in the following theorem. 


Theorem 2.2.1 


A random field X(t) is continuous in mean square at the point t* € A” if and 
only if its covariance function R(t, s) is continuous at the point t = s = t*. 

If R(s, t) is continuous at every diagonal point s — t then it is everywhere 
continuous. 


It must be emphasized that continuity in mean square does not imply sample 
function continuity. The latter is a far stronger condition which relies on much 
more specific behavior by the covariance function. We shall consider this in 
more detail later. 
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We now consider differentiability in mean square, for which we assume that 
the field X is real valued. It is not hard to establish the following result. 


Theorem 2.2.2 
If the derivative O^ R(s, t)/ðs; Ot; exists and is finite at the point (t, t) € R”, 
then the limit 
X(t + hd) — X(t) 
h 
exists, and X (t) is called the mean square (m.s.) derivative of X(t) at t. If this 


exists for each t € 4" then X(t) is said to possess a m.s. derivative. The covariance 
function of X (t) is then given by O?R(Ss, t)/0s, ôt. 


(2.2.3) Xt) = lim. 
h20 


One can, of course, continue further in this direction, defining, for example, 
second-order derivatives X;t), 1 < ij < N, by 


i j) t 6) — X(t kô; X(t 
E Osim a te a )— X(t + kd) + XO 
h,k> 0 


which will have the fourth-order derivative 0^R/Os; ôt; 0s; Ot; as its covariance 
function. 

Finally, we note that when the basic process X(t) is homogeneous, the con- 
ditions ensuring m.s. continuity and the existence of m.s. derivatives become 
particularly simple. For example, by Theorem 2.2.1, X(t) will be m.s. continuous 
at s if its covariance function is continuous at (s, s). But because of homogeneity, 
this is equivalent to demanding that R(t) be continuous at s — s = 0. Hence, 
if R(t) is continuous at the origin, X(t) will be m.s. continuous for all t e 2”. 
Similarly, if the 2kth-order partial derivative 0? R(t)/0?t;, - - - 0?t;, exists and is 
finite at the point t = 0 then the kth-order partial derivative X;; - ;, (t) exists for 
all t € Z” as a mean square limit. 

Implicit in these results is an indication of the fact that for a homogeneous 
random field the behaviour of its covariance function in the neighbourhood of 
the origin may be a determining factor in regard to mean square local properties 
(continuity, differentiability, etc.) of the field. That is in fact the case will become 
abundantly clear later on, when we shall see that at least for homogeneous 
Gaussian random fields this aspect of the covariance function determines 
virtually every aspect of the local behaviour of the sample functions of the field 
that we shall be interested in. Indeed, this 1s even the case when we consider 
almost sure local properties. 


2.3 STOCHASTIC INTEGRATION 


We shall not attempt in this section to give anything like a full treatment of 
stochastic integration on A", but shall limit our treatment to noting the existence 
of certain Fourier-type integrals, We again refer the reader interested in proofs to 
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the monographs of Cramér and Leadbetter (1967) in which the theory of 
stochastic integration for 4?! is treated in detail, or Gihman and Skorohod (1974) 
for the N-dimensional theory. Miller (1975) also contains a full account of the 
N-dimensional theory that we require. 

We commence by defining a particular class of random fields. Given any 
complex-valued random field Z(t), t € Z”, it is possible to define a random 
additive set function on the set of finite unions of intervals of 2" in the following 
fashion. Let I = (a,, b] x --- x (ay, by] be an interval in 2”, and set 


(2.3.1) Z(l) = Z(b,,.... by) 

— [Z(a,, b5,..., by) +--+: + Z(by,..., by 1, ay) ] 

+--+) + (—1)9Z(q,,..., ay). 
This defines Z(I) for any interval I c 4". The definition is extended to the 
class .s/" of sets A c A” which are finite unions of intervals of A” by demanding 
additivity, i.e. 

Z(A o B) = Z(A) + Z(B) 

if A, B c £" and A ^ B = $ (the empty set). 

Now suppose that X(t), considered as a random field, has finite variance for all 
t e A”. Then, by (2.3.1), E | Z(I)|? will also be finite for all 7 = 2". We then call 
Z(t) a field with orthogonal increments if for every pair of disjoint intervals 
L, l, c A, 

(2.3.2) E{Z(1,)Z(7,)} = 0. 

An immediate consequence of (2.3.2) is that such a field determines a measure on 
sf", generally denoted by F(I) and defined by 

(2.3.3) FU) = E|Z()/?. 

Such a measure will, up to a constant, determine a point function on 4 
that is non-decreasing in each tj, 1 <i € N. If we also use F to denote the point 
function, set F(— «,..., —o0) = 0, and demand that F is bounded over the 
whole of Z^, then F(t) has all the properties of a multidimensional distribution 
function multiplied by an arbitrary constant. The relationship between F as a 
point and set function is also of the form given in (2.3.1) for Z. Hereafter we shall 
assume, without explicit mention, that F(t) is bounded for all t e 2. 

For random fields of the above form it is possible to define a wide class of 
stochastic integrals known as mean square (m.s.) integrals, a subset of which is 
given in the following theorem. 


Theorem 2.3.1 


If Z(t) is a complex-valued random field with orthogonal increments, then for 
every t e Ay the Riemann Stieltjes integral 


(2.3.4) X [ dQ) dZ() 
ar i 
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is well-defined as 
An 
lim lim Y gQ,)ZCUn)) 
Ao nro j=1 


for all complex-valued g for which 
(2.3.5) i 1900)? dF(A) < oo. 
AN 


In defining the mean square limit we partition, for each A > 0 and n > 1, the 
interval (— A, AJ” into A disjoint intervals 1,5, 1 < j < A,,in such a way that 


sup Ay(,;) 5 0 as n ow. 
1<j<An 


By setting g(à) = exp(it - 4) we obtain from (2.3.4) a family of random variables 


(2.3.6) X(t) = f exp(it -2) dZ) 
| an 


which is well defined for any Z for which 
sup E{|Z(D} « o. 
AN 


The complex-valued field X defined in this fashion has rather special properties, 
for if we set F(t) to be the function defined by (2.3.3) and assume for the moment 
that E{Z(t)} = 0 for all t, we have the following theorem. 


Theorem 2.3.2 
For the random field X(t) determined by (2.3.5), 


E(X(t) = 0 d 
(2.3.7) Eu i 


R(s, t) = E(X(sSX(t) = [evt — t)- X] dF), 


where the integral is a standard Riemann-Stieltjes integral. 


An immediate implication is that since the covariance function R(s, t) of 
X(t) depends only on the difference s — t, the random field defined by the 
stochastic integral (2.3.6) is homogeneous. Furthermore, the spectral distribu- 
tion function of X(t), as defined in (2.1.4), is simply the function F(A) determined 
by the field Z(t) in (2.3.3). This automatically leads us to question whether the 
fact that an integral such as (2.3.6) leads to a homogeneous field whose co- 
variance function satisfies (2.3.7) implies that the converse holds; i.e. given a 
homogeneous field whose covariance function satisfies (2.3.7) can we represent 
itas a stochastic integral as in (2.3.6)? 
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We now proceed to give a positive answer to the above question. Suppose 
X(t), te 4", is a homogeneous, complex-valued random field for which 
E(X(t)) = 0 and E{|X()|?} < oo. Furthermore, let X(t) be continuous in 
mean square and let R(t) denote its covariance function. Then, by Theorem 
2.2.1, R(t) is continuous for all tand so by Theorem 2.1.2 has a representation of 
the form 


(2.4.1) R(t) = NTC : X) dF) 


where F(A) is the spectral distribution function of X(t). The following theorem 
is known as the spectral representation theorem. 


Theorem 2.4.1 


For every mean square continuous, zero-mean, homogeneous random field 
X(t) there exists a field Z(t) with orthogonal increments such that for each t, X(t) 
has the following representation as a mean square integral: 


(2.4.2) X(t) = NTC - X) dZ). 


The field Z(t) is defined up to an additive constant. If this is fixed by setting 
Z(—900,...,—00) = 0 we have 


(2.4.3) EZ()-20, E|ZQ) = FO) EZ)? = FQ) 


where I is any interval in @ and F is determined by (2.4.1). 


The representations of R(t) and X(t) given, respectively, by (2.4.1) and (2.4.2) 
are fundamental to the study of homogeneous fields. We shall now investigate 
some of their simpler implications for real-valued random fields. 

The representation of the covariance function of X(t) can be rewritten as 


R(t) = f cos(t- A) dF) + i f sin(t - X) dF(A). 
AN AN 
If the field X(t) is real valued, then so is R(t), so that 
(2.4.4) f sin(t - 4) dF(A) = 0 for all t. 
AN 


This fact immediately implies that the spectral distribution is symmetric about 
the origin of 4", in the sense that for any 4;,v;,1 € i <N, 


F(A vy) x oe x Aye vp - FA, ova] xee x -An v 
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If a spectral density function f(A) exists, it is also symmetric, in the sense that 
fQ) = f(A) for all X € Z”. Consequently, the odd-ordered moments of F, 
when they exist, are zero; i.e. 


N 
(2.4.5) f diab... av dF) =0 if Și is odd. 
gN j=1 


Another useful consequence of (2.4.1) is the existence of relationships between 
the even-ordered moments of F and the behaviour of R(t) near t = 0. For 
example, 


(2.4.6) f dF(A) = R(0). 

GN 
Furthermore, if we write 4;; to denote the second-order spectral moment 

I Aid dF) 

AN 
and set R;j(t) = 0?R(0)/ót; ot; then 
` O^ R(s, t) 

2.4.7 = = —R.(0 
(2.4.7) ae 0) 


In general, the 2kth-order spectral moments are equal to the appropriate 
2kth partial derivative of R(t) at the origin times ( — 1)*. 

From Section 2.2 we have that the mean square derivative X(t) of X(t), 
if it exists, has the partial derivative R;(t) as its covariance function. Since the 
variance of X ,(t) is then given by R;,(0), in view of (2.4.7) we have 
(2.4.8) E(|X(0)) 2A; fori=1,...,N. 

Similarly, the variance of a kth-order mean square derivative of X(t) will be 
given by the appropriate spectral moment of order 2k. Indeed, one can easily 
derive the following general result: 
O**PX(t) PXW 
OU 0Pt, Ot, Mt, 
QuiBtrtó 


— 6t, Pt; O't, Ot 


R(t) 


t=0 
= (Caere AUALA AF (A). 
AN 


Combining this with (2.4.5) it follows from an appropriate choice of «, f, y, ô that 
for a real-valued homogeneous field X(t) the following relationships hold for 
alll <ikl<N: 


(24.9) X(t) and X(t) are uncorrelated (Bf =y=6=0,a = 1), 
(2.4.10) X(t) and X,,(t) are uncorrelated (a: y> dd - 1, f - 0). 


32 HOMOGENEOUS FIELDS AND THEIR SPECTRA 2 


If X(t) is a Gaussian field than so are its mean square derivatives. (This is a 
simple consequence of the fact that derivatives are defined by taking limits of 
linear combinations of Gaussian variates. See equation 2.2.3.) Furthermore, 
the joint distribution of X(t) with its derivatives is multivariate Gaussian. 
Thus in this case the variables considered in (2.4.9) and (2.4.10) are not only 
uncorrelated but also independent. We shall make continued use of this fact 
in later chapters. 

We conclude this section with a brief discussion of the heuristic meaning of 
the spectral representation of a random field. Firstly, note that if X(t) is real 
valued we can rewrite (2.4.2) as 


(2.4.11) X(t) = f [cos 4 - t dU(A) — sin X - tdV()] 
AN 


where U(t) and V(t) are real-valued fields defined by 
(2.4.12) Z(a) = U(A) + iV(A). 


That is, U and V are the real and imaginary components of Z. Now approximate 
the integral in (2.4.11) by a sum of the form 


(2.4.13) Y. {cos(h; - U(A) — sin; - V(A)) 


where the A; are disjoint intervals in 4" and i; € A;, i = 1, 2,.... Then it is 
clear from this approximation that the spectral representation of a random 
field is effectively a method for breaking it up into a large number of ‘sinusoidal’ 
components. 

In the one-dimensional situation the basic components in (2.4.13) are simple 
sine and cosine waves of (random) amplitudes — V(A,) and U(A,), respectively, 
and wavelengths equal to 2z/A;. In higher dimensions the elementary components 
are slightly harder to visualize. Consider the two-dimensional case. Dropping the 
subscript on À; for the moment we have that an elementary cosine wave is of the 
form cos(A,t, + 22t2). The 4; are of course fixed, and the point (t,, t,) takes 
values in #7. A little elementary trigonometry shows that such a function 
actually forms a ‘wave-pattern, in #* of a sequence of waves travelling in a 
direction which makes an angle 


(2.4.14) 0 = tan” (2) 
with the t, axis, and having the wavelength 

2n 
VJ+ 


as the distance between troughs or crests, as measured along the line perpendicu- 
lar to the crests, A pictorial representation of such a surface is given in Figure 
2.4.1, 


(2.4.15) qus 


2.5 ISOTROPIC FIELDS 33 


Figure 2.4.1 Elementary wave form cos(A,t,; + À 51) in??. The crests of the waves lie on 
the lines shown 


The corresponding sine function is exactly the same, except that its crests lie 
on the top of the troughs of the cosine function, and vice versa. That is, the two 
sets of waves are out of phase by half a wavelength. As in the one-dimensional 
case, the amplitudes of the components cos(A, - t) and sin(A, - t) are given by the 
random variables U(A;) and V(A)). 

This concludes our study ofthe fundamental aspects ofthe spectral representa- 
tion for random fields. All of these have been closely analogous to corresponding 
results for one-dimensional processes. We now turn to a phenomenon which, 
although essentially vacuous for processes on the line, opens up a number of 
interesting problems for fields. 
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An interesting special class of homogeneous random fields that often arises in 
applications is theclass of isotropic fields. These are characterized by the property 
that the covariance function depends only on the length |t|| of the vector t 
so that 


(2.5.1) R(t) = R(|t||). 


This restriction has a number of surprisingly varied implications for both the 
covariance and spectral distribution functions, which we shall explore in a 
moment. 
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The concept of isotropy essentially arises from areas of application of random 
fields in which there is no special meaning attached to the axes of # being 
used. For example, if one were to model a geographical terrain as a random 
field it would be usual to take axes in the north-south and east-west directions. 
However, the local terrain of a small region may well appear to be totally 
independent of direction, so that the covariance between the height above sea 
level of two points in the region would be most appropriately modelled as 
depending only on their distance apart, and not on the direction from one to 
the other. Such an assumption leads to (2.5.1) and an isotropic field. 

Isotropy places a number of restrictions on the possible behaviour of the 
covariance function. For example, we have the following result, due to Matérn 
(1960). 


Theorem 2.5.1 


If R(t), te A~, is the covariance function of a zero-mean, isotropic random 
field X(t), then R(t) => — R(0)/N for all t. 


Proof 


Firstly, note that because of the isotropy of X we can write R as a function on 
the non-negative real line only. Let d be any positive real. We shall show that 
R(d) 2 —R(O)/N. 

Select (N + 1) points t,,...,ty,, of Z” for which |t; — t;|| = d for all i # j. 
Then, by (2.5.1), 

2 
= (N + D[R(0) + NR(d)]. 


N+1 


E| Y, X(t) 


Since this must be positive, the result follows. 


The restriction of isotropy also has significant consequences for the spectral 
distribution function F(A). Let $ be a rotation transformation from # to A”. 
That is, if we write 


$(t) = (61(0. .... (0), 


then we have 
N 
$t) = > ait; 
J=1 


where A = (aj) is an orthogonal matrix. Of course, ||@(t)|| = litl for all t. 
Isotropy implies R(b(t)) = R(t) for all t. The spectral representation (2.1.4) of 
R then implies 


(2.5.2) f expit 3) dFQ) = Í exp(it X) dF($ ^ '(2)) 
an aN 
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where $ ^! is the rotation inverse to @. Since (2.5.2) holds for all t it follows that, 
for any B € 4", F(B) = F($(B)). Hence if F possesses a density f, then f, like R, is 
dependent only on the modulus of its argument. Thus we have the following 
theorem. 


Theorem 2.5.2 


If X(t) is a mean square continuous, isotropic random field then its spectral 
density function f (A) is a function of ||.| only. 


An interesting consequence of this result is that an isotropic field cannot have 
all the probability of its spectral distribution concentrated in one small interval 
in 2". Thus, in particular, it is not possible to have a spectral distribution 
degenerate at one point, unless that point is the origin. The closest the spectral 
distribution of an isotropic field can come to this sort of behaviour is to have all 
its probability concentrated in a\narrow annulus of the form 


{ae Aa < lAl < bj 


for some 0 x a < b. In such a case it is clear from (2.4.14) and (2.4.15) that the 
field itself is then composed of a 'sum' of waves travelling in all directions but 
with wavelengths between 22/b and 2n/a only. 

Theorem 2.5.2 implies that under isotropy f reduces to a function of one 
rather than N variables, and so it is natural to ask whether the spectral repre- 
sentation itself can be simplified under these circumstances. The following 
result, due originally to Schoenberg (1938) (in a somewhat different setting) 
and Yaglom (1957), answers this question in the affirmative. 


Theorem 2.5.3 


For R(t) to be the covariance function of a mean square continuous, isotropic 
random field on A“ it is necessary and sufficient that 


© Jow- alà 
(2.5.3) R(t) = Í RP ip daO, 


where G(A) is a bounded non-decreasing function and J,, is the Bessel function of 
the first kind of order m; namely 


o0 2 2k+m 
Jm(X) = p (-1* (x/ ) 
=0 


K!T(K + m+ 1) 


Proof 


The proof consists in simplifying the representation 


(2.5.4) R(litl) = R(t) = f explit - 2) dFA) 
aN 
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by using the symmetry properties of F. We commence by converting to polar 
coordinates, through which we identify each X € # by its distance ||A|| from 0 
and the point on the (N — 1)-dimensional unit sphere S"^! intersected by the 
line connecting 0 and X. To obtain a coordinate system (0,,.. ., 0, .,) for SYT! 
we proceed recursively, firstly representing points à € S!, the unit circle in #7, 
by a single coordinate 0,, 0 < 0, < 2n, determined by 


(2.5.5) À = (cos 0,, sin 0,). 


On S*~', the set of all points for which 4, = cos @y_, forms an (N — 2)- 
dimensional sphere of radius sin 0... ,, so that if we have a coordinate system 
(0,,...,04..,) for SNT? the addition of 6y_, automatically produces a co- 
ordinate system for S7 !. Hence, starting with (2.5.5), we obtain a coordinate 
system (0,,..., 0, .,) for S"! with the property that setting 6,_, = constant 
generates a sphere of dimension N — 2. This yields a representation, as noted 
above, for any à e A”. 

Now write G(A) for the function on [0, oo) defined by G(A) = fy; <1 GFA). 
Then, on substituting into (2.5.4) with t = (||t||,0,...,0) and performing the 
coordinate transformation, we obtain 


Rae = [| expGaltllos 6...) da) dG) 


where c is the surface area measure on S^ !. Integrating out 6,,...,0y—> it 
follows from what we have already noted that 


(256) R(ItI) = Ay f f exp(alltlicos x- Xsin 8y- 1)"7? dy, dG(4) 
0 0 


where Ay is the surface area of S". 
The inside integral can be evaluated in terms of Bessel functions to yield 


Jt - zyxXAltll) 
ad 2? . 


which establishes the theorem, on absorbing the constant Ay...) into G(4). 


f exp(iA||t|| cos 0) sin? 0 dé = 
0 


For small values of the dimension N the spectral representation of R(t) 
given by this result can be simplified even further. For example, substituting 
N = 2 into (2.5.3) yields that in this case we have 


(2.5.7) R) = f *To(Allt]) dGQ), 
0 


while substituting N = 3 into (2.5.6) and evaluating the inner integral easily 
yields that in this case 


? sin(Altl) 
5 = s (A). 
(2.5.8) R(t) = 2 f ije 900 


2.5 ISOTROPIC FIELDS 37 


In a large number of applications, random fields that are functions of ‘space’ 
and ‘time’ arise, so that the parameter set is most conveniently written as (t, x), 
t € ®', x € 4". Such processes are often homogeneous in (t, x) and isotropic in 
x in the sense that 


E(X(s, u)X(s + t, u + x)} = RG, lxl), 


where R is a function from 2? to C. In such a situation the methods of the 
previous proof suffice to show that if X is mean square continuous then we can 
write its covariance function in the form 


(2.59) RG xp = [| epia aco, 2 


2 (N-2y2 
Hy(x) = B F(Z va 


where 


and G is a multiple of a distribution function on the half-plane 
{(, v):0 <A < 0, —oo «v « oo. 


Given the simplicity of the spectral representation of the covariance function 
of an isotropic field, it is natural to seek a corresponding simplification of the 
spectral representation of the field itself. To do this foreach N < 1 we introduce 
a set of functions (hU), | = 1,..., dm, m = 0, 1,...} defined on the unit N-di- 
mensional sphere S" and known as the set of spherical harmonics on S", which 
form an orthogonal basis for the space of square integrable functions on S". 
(For details of these functions, see Erdélyi, 1953.) The sequence {d,,} is a fixed 
sequence of integers. Now use the spectral decomposition 


X(t) = Í exp(it - 4) dZ(A) 
AN 


to define a family of orthogonal increment random set functions defined on the 
intervals A c [0, oo) by the formula 


(2.5.10) ZA) = f f (0) dZ(A, 0) 


where we have converted to the parameterization of 2" developed in the proof 
of Theorem 2.5.3. From this family, define a family of mutually uncorrelated, 
stationary, one-dimensional processes {X mı} by 


? Jn tN ar 
(2.5.11) X mit) = f n DGD AZn) 


Then it is possible to establish the following result, a proof of which can be found 
in Wong (1971). 
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Theorem 2.5.4 


A zero-mean, mean square continuous, isotropic random field on 4" can be 
represented by 


o d 


(2.5.12) X() = X00) = YY eX ml) 


m=0 1-1 


for some sequence of positive constants c,,. 


In view of (2.5.11) and the form of the Zm, this result provides a spectral 
decomposition of an isotropic process. Furthermore, it indicates that such 
processes can be decomposed into a countable number of mutually uncorrelated 
stationary processes with a one-dimensional parameter, a result which one would 
not intuitively expect. 

Other representations of a specific class of isotropic fields have been obtained 
by Mittal (1976) and Berman (1978), generalizing the one-dimensional results 
of Berman (1975) and Mittal and Ylvisaker (1976). These fields have covariance 
functions that can be written in the form 


(2.5.13) Rose M 


€ Ju2 


[ sin 72g ax) f(x) dx, 


0 


where 


HEUS t 
c= f sin’ ?« da, 0 = arcos| — ], 
0 2x 


and f (x) is a density function such that f (x)/x" ^ ! is non-increasing. For these 
fields it can be shown that the 'distribution' function G in (2.5.3) has a cor- 
responding density and that the field itself can be represented as a stochastic 
integral of a Brownian sheet (cf. Section 8.9) over certain subsets of 2 *!. 
Some applications of these fields are given in Dalenius, Hájek, and Zubrzycki 
(1961). 


CHAPTER 3 


Sample Function Regularity 


fo 
31 SUITABLY REGULAR FUNCTIONS 


In Chapter 1 we commenced a preliminary study of some simple aspects of 
certain random sets generated by random fields. In one example we looked 
at the arc length of the random contour lines {(t,, t2) € S: X(t,, t2) = uj. 
Implicit throughout that analysis was the assumption that these contour 
lines were well-defined objects satisfying certain regularity properties. For 
example, it was necessary that, with probability one, they be continuously 
differentiable. 

In this chapter we shall make a careful and detailed study of regularity 
conditions for random field sample functions. To achieve this, we shall commence 
with the concept of 'suitable regularity defined below, which lists, for non- 
random functions, the properties we shall later often require to hold for random 
functions. We set about deriving sets of conditions on various aspects of the 
finite dimensional distributions of random fields which ensure that the sample 
paths of the fields possess these properties with probability one. The most 
common regularity property that we shall require is that of sample path 
continuity. We commence a treatment of this for general fields in Section 3.2, 
and in Sections 3.3 and 3.4 we give a fully detailed account for the Gaussian case. 
Since virtually every random field we shall consider in later chapters will be 
both Gaussian and sample path continuous with probability one, this case is of 
particular importance. 

In Chapter 4 we shall see that, at least for processes defined on the real line, 
the study of level crossing problems breaks up naturally into two disjoint parts. 
On the one hand we have the situation in which the number of level crossings in 
a finite interval is almost surely finite, while on the other this variate is infinite 
with probability one. A similar dichotomy exists in the study of random fields. 
In order to study the 'finite' situation it is necessary to impose a number of 
conditions on the sample paths of the fields. Indeed, what we shall later generally 
assume is that sample paths belong, with probability one, to the special class of 
functions we now define. 
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Definition 3.1.1 


Let F(t) be a non-random function from R” to 2?! possessing first- and second- 
order partial derivatives F; = OF[0t;, F;; = 0°F /dt; 0t, i, j = 1,2,..., N. Let 
S c & be a compact subset of A“ whose boundary 0S has zero Lebesgue measure. 
Then if for every permutation (ji, j2,- . ., jy) of (1, 2, ..., N) and a fixed real u the 
following four conditions are satisfied, we say that F is ‘suitably regular’ with 
respect to S at the level u. 


(3.1.1) F has continuous partial derivatives of up to second order in an open 
neighbourhood of S. 

(3.1.2) There are no points te S for which F(t) = u and F(t) =Q, j = 
bin N, 

(3.1.3) There are no points te 0S for which F(t) = u and F;(t) 20, v = 
12, .., N—1L 


(3.1.4) If D(O denotes the symmetric(N — 1) x (N — I) matrix with elements 
F5 (0. v, T= 1,...,N — 1 then there are no points t € S for which 
F(t) = u, F;(t) =0,v = L2,...,N — land det D(t) = 0. 


Although at first glance suitable regularity seems to demand a large number of 
extremely restrictive conditions, this is not actually the case. In two dimensions, 
for example, it does little more than ensure that the contour lines, 


{ti tre S: F(t,, t2) > u}, 


are smooth, twice differentiable, curves. In higher dimensions the corresponding 
conditions do little more than ensure that the level sets are smooth, (N — 1)- 
dimensional manifolds. 

In the following section we investigate conditions that generate random 
fields possessing sample paths that are suitably regular functions with probability 
one. In the reader’s interest we point out that now that the definition of suitable 
regularity has been introduced it is possible to proceed directly to the following 
chapter which starts the serious treatment of the matters of random field 
generated geometry that are the central concern of this book. The remainder of 
this chapter can be read at some later time. 


3.2 SUFFICIENT CONDITIONS FOR REGULARITY 


In this section we shall investigate conditions which guarantee that the sample 
paths of a random field satisfy the conditions of suitable regularity with proba- 
bility one. We shall, however, treat these conditions in two distinct groups. 
Condition (3.1.1) which entails sample function continuity and differentiability, 
is, of its very nature, distinetly different to the remaining three conditions which 
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essentially demand that the function and its derivatives (or some functions of 
them) do not take on certain values at the same point. We shall treat the latter 
class of conditions first, and so we shall commence by assuming for the moment 
that we are dealing with an X(t) that is a homogeneous N-dimensional random 
field possessing, with probability one, continuous partial derivatives of up to 
second order in an open neighbourhood of some compact set S of 2”. The set 
S will be considered fixed in all that follows. We shall obtain a set of conditions 
sufficient to guarantee that these assumptions hold later on. 

We remind the reader, for the last time, that we shall always assume that we 
are dealing with separable random fields. This assumption is crucial to many of 
the arguments we shall use in this section. In a,certain sense, we justify this 
assumption in the following section, where we shall establish the existence of 
separable, sample function continuous, Gaussian fields. 

The first aspect of suitable regularity that we shall consider is that given in 
(3.1.2), which guarantees that sample functions do not, almost surely, have 
critical points (i.e. points at which X (t) = 0 V j) at a fixed level u (which will 
remain constant throughout this section). The result we shall establish is a 
multiparameter analogue of a result due originally to Bulinskaya (1961) (see 
also Cramér and Leadbetter, 1967, pp. 75-76) that states that under reasonable 
conditions random processes on the line are never tangential to a given level 
in a finite time period. We shall establish the result for general homogeneous 
fields, without any Gaussian assumption. 


Theorem 3.2.1 


Suppose the probability density of the vector [X(t), X,(t),..., X.(O] is 
bounded by a finite constant K. Then for arbitrary real u there are almost surely 
no points t € S at which this vector-valued field equals [u, O, . . ., 0]. 


Before we can prove this result we require some preliminary notation and a 
lemma. Hence we commence by introducing the following notation for the 
moduli of continuity of X and its first- and second-order partial derivatives. In 
each case i,j = 1,2,..., N and the supremum is over t and s contained in the 
set S: 


@x(h)= sup |X ~ X(s)]. 


lt-sl| «^ 

(3.2.1) ch) = | sup |X ,(t) — X (9), 
it-sl| «^ 

eh) = sup |Xj;(t)— X;{s)|. 
lt-s|| «^ 


We can now state the lemma. 
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Lemma 3.2.1 


Given any £ > Q it is possible to find a continuous function w,(h) for which 
cex(h) | 0 as h | 0, and finite positive constant C,, such that P{E,} > 1 — e where 
E, is the event 


fmax sup| X (O| < C,, ox(h) < NhC,, max wh) < NhC,, 
ij teS j 
max w;;(h) x @,(h), forO <h< n 
i,j 
Proof 
For ease of'notation, set 


w*(h) = max oh). 
1<i,j<N 


Then, for every h > 0, w*(h) is a random variable, and since the X; j are almost 
surely continuous it is clear that 


limP(o*(h) < c) 21 

h^0 
for every fixed c > 0. Let {c,} bea real sequence, strictly decreasing to zero. Then 
for any e > 0 we can always find an A, > 0 such that 


P(o*(h) < e) > 1-2 5. 


Since w*(h) is non-increasing in h, it follows that 
2 "e 


Pío*(h «c,0«hzxh,j)-»1- 6 


We may assume that the h, also form a sequence decreasing to zero, and then 
we have 
2 "t 

6 


P(o*(h) <c,,0<h<h,n=1,2,..}>1- Y 
n-1 


21-2. 
6 


Defining w,(h) = c, for h,,, < h < h,, we have 


Pio*(h) « oh Ohh} >l e 
b 
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If h, < 1 simply set co, (h) = co for h, < h < 1, where co is large enough so that 
P(o*(h) < coh «h 1) 51 x 

Combining the last two inequalities yields 

(32.2) P| max «,(h) < ol >1 =e 


1<i,jsN 


This completes the first stage of the proof. We now note that the existence ofa 
finite C, for which 


P{max sup| X,t)] < cl dp 
ij teS 3 
follows from the fact that were there no such C, we would have 
P| max sup|X;()| = coh >is 0, 
ij tes 3 
which would contradict the continuity of the X;; over the compact set S. 


Furthermore, if for a given realization we have 
max sup|X;,(t)| < C,. 
nj teS 
it immediately follows from N applications of the mean value theorem that 
max sup |Xft)— X,(s)| < NhC,, 
Jj. dt-sll«h 
so that we also have 
(3.2.3) P| max sup|X,,(t)| < C,,max oh) < "ic. >1-— 5 
ij teS j 
Finally, applying the same arguments to the X ; and X, it follows that 
£ 


(3.2.4) Plox(h) < NhC} > 1-3 


as well, where we increase C, if necessary. Combining the inequalities (3.2.2) to 
(3.2.4) then establishes the lemma. 


Proof of Theorem 3.2.1 


We are now in a position to prove Theorem 3.2.1. From the compactness 
ofthe set S and the homogeneity of X it is clearly sufficient to prove the theorem 
when S is the unit cube in #, ie. 


S-L tte 0znzll-LH2...N). 


o 
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Let m — 1, 2,... and consider a sequence of decomposition of I, into m" cubes 
Imk» k = L..., m", where each I,,, has vertices of the form t = (k,/m,..., kn/m) 
and edges of length m` +. The k; are integral and 0 < k; < m. Write t,, for the 
midpoint of I,,,, i.e. the point which is equidistant to the 2" vertices of Imp- 
Furthermore, define 4,4, 1 < k < m", m > 1, to be the event that there exists 
ateI,,, at which X(t) = uand X (t) = 0,j = 1,2,..., N, while A represents the 
same event for t € /,, ie. A = A,,. We thus wish to show P{A} = 0. Since 


Ae Ase for any m, 
k=1 
we have, for arbitrary £ > 0, 
(3.2.5) P{A} < Y P(A,, ^ E3 + PEE 
k=1 


where E, is the complement of the set E, defined in the previous lemma. We 
shall obtain an upper bound for the above summation. Suppose that both E, 
and Amg have occurred, so that, at some t € Imk, 


X -u = X(0- -= XN) = 0. 
Then the bounds on cy and the œ; inherent in E, imply that 
|X(t,j) — ul € N??m"!C, 
and 
[Xjt,)] € NP7m^!C,, — forj =1,...,N. 
Thus, under the conditions of the theorem, 
P(A,, ^ E} € P{|X(tm) — u| < N?7m^!C,, | X (tm) | 
< Ng Caj = 1,...,N} 


< KN?!2m- 1 C (N??2m- 1C)" 
= KCP'* INSUN * D/2, - Wt 1) 


Substituting this into (3.2.5) and using the fact that P(E,) > 1 — e, we obtain 
P(A) < m (KCN+ INOW * 02) +g, 


Choosing first e small enough and then m large enough, we can make the above 
probability arbitrary small. This completes the proof of the theorem. 


We now turn to that aspect of suitable regularity covered by condition 
(3.2.4). Since the proof of the following result is similar to that of the preceding, 
Wc trcat it out of turn. 
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Theorem 3.2.2 
Suppose that the joint probability density of the vector 
[X(0, X 1(0,..., X. -.(0] 


is bounded by a finite constant K and that for any real numbers x,,... , xy. , and 
fixed u the following non-degeneracy condition is satisfied: 


(32.6) P(det D() = 0|X(t) = u, X( = xp j = L2,...,N — 1} 20, 


where D(t) represents the (N — 1) x (N — 1) matrix with elements X; (t), 
i, j21,2,...,N — 1. Furthermore, suppose that the conditional probability 
(3.2.6) is continuous in the values of det D(t). Then, with probability one, there 
are no points t € S at which this vector-valued variable equals (u, 0, . .., 0] and 
det D (t) = 0. 


Proof 


The proof follows that of Belyaev (1972a, Theorem 3.1). Again we need only 
consider the case S = I,. Retaining the general notation of the previous proof, 
let Bm be the event that there exists a t € I„ą at which X(t) = u, X (t) = 0, 
j=1, 2,....N — 1, and det D(t) 2 0. Furthermore, let B = B,, be the 
analogous event for t € I,. Then we must shown P{B} = 0. Since 


B = J B mk 
k=1 
we have, as before, 
(3.2.7) P{B} < » P(B,, © E,} + PEE. 
k=1 


Let te Im and write X;(t,.,) as X;,(t) XX ift.) — X y6]. Then it is easy to 
see that when weexpand det D (t,,) we obtain det D(t) plus(N — 1)!(2^^! — 1) 
extra terms. If the event E, has occurred each of these is bounded in modulus 


by an expression of the form C^ 1 To, G/ N m )| for some r between 1 and 
N — 1. If Bm has also occurred, it then follows that 


Idet D(t4)| < Q'^' — DIN — DIC 20,0 /Nm !) 
and also that 
IX,)| € N??m^!C,,  forj=1,...,N — 1, 
and 
IX(J44) — u| € N??2m-1C,. 


The last two inequalities are easily obtained using the bounds on c, and the œ; 
implied by F,. 
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Using now the homogeneity of X(t) we have 


(3.28) P(B, n E,} < f P{|det D(t)|< (NY? C o / N m^) 


|X(t) = xj x (x) dx, 


where we use X(t) to denote the vector-valued variable [X(t), X ,(t),..., Xy_1(0] 
and (x) is the probability density of X(t). The integral in (3.2.8) is over 
(u — N??m7'C,,u + N??m^1C,) for X and (—N?/?2m7'!C,, N??7m ^! C.) for 
the X;. Since #(x) is bounded under the conditions of the lemma and the 
conditional probability in (3.2.8) is bounded by unity and tends to zero as 
m — oo for any x, it follows from the dominated convergence theorem that for 
any ô> 0 and large enough m, P(B,, © E,} < óm ^". Combining this with 
(3.2.7) and the fact that P{E,} > 1 — e gives us P{B} < 6 + e. Since ô and e were 
arbitrary this fact completes the proof of the theorem. 


Finally, we obtain a result related to condition (3.1.3) of suitable regularity. 


Theorem 3.2.3 


If X satisfies the conditions of Theorem 3.2.1 and if the boundary of S has 
Lebesgue measure zero then, with probability one, there are no points t e 0S at 
which X(t) = u and X (t) = 0,j = 1,2,...,N — 1. 


Proof 


Firstly, we note that since the boundary of S, dS, has Lebesgue measure zero 
we can find a sequence of collections of cubes I,,,, each having edges of length 
m+, which cover ôS. Furthermore, if M(m) denotes the number of such cubes 
in the mth collection, the cubes can be chosen in such a way that m^" M(m) > 0 
as m > oo. Then if we use C,, to denote the event that there exists a te In, at 
which X(t) = u and X (t) = 0,j = 1, 2,..., N — 1, and C to denote the ana- 
logous event for t e 0S we have, as usual, 


M(m) 
P(C) < Y, P(C,, ^ E,} + P{E,}. 
k=1 
The same arguments as used in the proof to Theorem 3.2.1 lead to 


PIC ma n E,] sS P{| X (tm) = u| < N*2m ES. | X (t) 
< NY?m CLIP 02,.., N ~ 1} 
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where again t,,, denotes the midpoint of I „ą. Combining the last two inequalities 
with the conditions of the theorem gives us 


P(C) < M(m)KN??m !C(N??m ON! + e 
= m "M(m) (KCPN?N?) + e 


Choosing firstly ¢ small enough and then m large enough completes the proof. 


The results of the previous three theorems can now be combined in a straight- 
forward manner to give a set of general conditions for the suitable regularity 
of a random field, viz.: 


Theorem 3.2.4 


Let X(t), te 2“, be a homogeneous random field possessing almost surely 
continuous partíal derivatives of up to second order in an open neighbourhood of 
any arbitrary compact set S c A~ whose boundary has Lebesgue measure zero. 
Let P = (ji, j2, ..., jy) be a permutation of (1, 2, ..., N). Write Dp(t) for the 
(N — 1) x (N — 1) matrix with elements X, ; (t), v, n = 1, 2,..., N — 1. Then 
X is a suitably regular field if the following three conditions are satisfied: 


(a) The joint probability density of [X(t), X ,(,..., X (t)] is bounded by a 
finite constant. 

(b) The joint probability density of [ X(t), X ;,(, ..., X;, ,(t)] is bounded by 
à finite constant for any permutation P= (ji, j;, ... jq). 

(c) For arbitrary real numbers x, x5, .. - , Xy — 1, fixed u, and every P, 


P{det D,(t) = 0|X(t) = u, X; (Ù = x,, v = 1,2,...,N — 1) 20, 


and this conditional probability is continuous in values of det D, (t). 


This result actually gives us quite a neat summary of the conditions on uni- 
variate densities we need to assume for the suitable regularity of sample paths. 
The only condition that is not readily verifiable from the univariate densities 
is that of sample path continuity and differentiability. We shall treat this in 
detail for Gaussian fields in the following section. However, for the sake of 
completeness we note here the following result, due to Belyaev (1972a), which 
gives bounds in probability on the modulus of continuity for a general field, 
from which we shall obtain sufficient conditions for sample path continuity, 
differentiability, etc. Since our main interest is in the sharper Gaussian results of 
the following section, we shall neither give a proof of Belyaev's result nor 
consider its implications in much depth. (See, however, Kozaéenko and 
Jadrenko, 1976a, 1976b, for a more detailed analysis of this type of result.) 
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Theorem 3.2.5 
Let X(t) t € A“ be a real-valued random field, and suppose that 


(3.2.9) P(|X(t + h) — X(0| > c(h} x g(h) 

for all h for which |h| « h, where the functions e(h) and g(h) satisfy 

(3.2.10) y, 282 "7g(27?") < oo, y, 272") < oo. 
m=1 m=1 


Then for the function 


oo 


yh) = Y 2v"''gQ 7^, 
m=k(h) 

where k(h) is the unique integer for which 27>*%*) < h < 2-?"9 we have, as 

h|9, 


(3.2.11) P(oxy(h) > «(h)) < Wh). 


A simple application of the Borel-Cantelli lemma, as in the following proof, 
can be made to obtain simple conditions sufficient to ensure sample function 
continuity with probability one. Note, however, that (3.2.11) can give us much 
more information than mere sample function continuity. This type of informa- 
tion will be important later on. The following corollary provides useful sufficient 
conditions for continuity. 


Corollary 
If for « > Q and y > x we have 
C||hj?" 
3.2.12) E|X(t + h) — X(t) x ——__ 
) logih" 


then the random field X(t) will be continuous with probability one over any compact 
set S c gi. 


Proof 
Note first that, by the Markov inequality (1.3.1), 


E(|X(t — h) — X(0/5 
|&CI hl) [* 


(3.2.13) P(|X(t + b) — X(®| > «hl < 


Then set 
ah) flog | fl]? 
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for some 1 < $ < n/a, and 


C h 2N 
gh) ^| 


— lleginI] *" - |eQ |" 


Then 


Y «27? = Y [log 27?" 
1 m-i 


= |log 2|7 Y, 27", 
m=1 


which is finite, since fj > 1. Furthermore, 


o0 o0 2N2" AC 2-2N2" 
N2mt*1 -2m, _ 
RU 7497 = 2 fog 2-7m flog 


m=1 


= C|log 2 Pacey 5 2-m( +y- Ba) 
m=1 


which is finite since 0 < fæ < 4. It is easy to check from (3.2.13) that this choice 
of £ and g satisfies (3.2.9), so that all the conditions of the theorem are fulfilled. 
Furthermore, for this and g we have 

oo 2N2* *1 5, 4- 2N2* 


W= X 


ix [log 2:77 ]** ^s [log 2-7 [P5 


L C|log 2jfe"71 y 2- ka +n fa) 
k 


=m 


= K.2-mü*n-82) 
for some positive, finite, K. Hence (3.2.10) implies that for m large enough 
(3.2.14) Píoy(2^?") > |log 2| 72^ "^1 < K.2- n tn” Ba), 


But since the right-hand side of this inequality is a term of a summable sequence 
a simple application of the Borel- Cantelli lemma immediately establishes the 
sample function continuity of X, as required. 


Theorem 3.2.5 can also be used to obtain results for the partial derivatives of 
X(t). For example, if we set x = 2 in condition (3.2.12) it becomes 


C||hi?" 


R(t + h, t + h) — R(t + h, t) — R(t, t + h) + Rit) € — t — 
lad i ass [log hl] [.** 


where q > 2 and R is the covariance function of X. If we now assume X has a 
first-order mean square derivative X(t), then its covariance function will be 
R*(s, t) = 0? R(s, t)/0s, ĉt;, and we can use the above condition, with R replaced 
by R*, to determine if X (t) also has continuous sample paths and to obtain a 
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bound, in probability, on its modulus of continuity. Higher-order derivatives 
can be investigated in an analogous fashion. 

Finally, we note that results similar to the above can be found in the paper of 
Yadrenko (1971a), who obtains results sharper than (3.2.11) for guaranteeing 
sample function continuity. His results, however, do not explicitly contain 
bounds of the form (3.2.10). 
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If we now specialize to Gaussian random fields, we would hope that the 
conditions of Theorem 3.2.4 on the finite dimensional distributions of the field 
and its various partial derivatives that guarantee the suitable regularity of its 
sample functions could be written in a simple, and weak, form. Fortunately, 
this hope can be fulfilled. As we have already noted, if a random field X(t) is 
Gaussian then it immediately follows that the partial derivatives X(t) and 
X ít), if they exist in the mean square sense, are also Gaussian fields. Further- 
more, the joint distributions of all these processes are multivariate Gaussian. 
Consequently, from mere inspection of the multivariate Gaussian density (1.6.3) 
and the conditions of Theorem 3.24, it is clear that a Gaussian field is suitably 
regular as long as its various partial derivatives are continuous with probability 
one and the K = (N + 1)(N + 2/2 random variables X(t), X,(t), X;Kt) 
1 <i,j < N, havea non-degenerate joint distribution; i.e. the K x K variance- 
covariance matrix of these variates has a non-zero determinant. 

Hence the question of suitable regularity for Gaussian fields hinges almost 
exclusively on the question of sample function continuity and differentiability. 
It is possible to use the results of Theorem 3.2.5 to investigate this problem, but 
sharper results can be obtained by using arguments specifically constructed 
for Gaussian processes. We shall adopt the latter approach and investigate the 
sample path properties of Gaussian fields by first obtaining a representation of 
such a process through a type of eigenfunction expansion. This expansion, 
generally referred to as a Karhunen—Loéve expansion, will be the central con- 
cern of this section. This representation will not only eventually yield the 
information we are currently seeking but, as well as being of intrinsic interest, 
will also be of use to us in the future in providing proofs of other results. (For 
example, one of the results implied on the way to developing the expansion is 
that the continuous Gaussian fields we have been talking about so freely do in 
fact exist and, furthermore, are separable; see Theorem 3.3.2.) Before we can 
state this result, however, we need to set down some terminology and notation. 

Let T be a compact interval in Z” and suppose that on T x T we have a 
continuous, real-valued, non-negative definite (covariance) function R(s, t). 
Consider the integral equation 


(3.3.1) f R(s, Od(t) dt = Ad(s), forse T. 
J 
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A nonzero number 4 for which there exists a function $ satisfying both (3.3.1) 
and the integrability condition fy | @(t)|? dt < oo is called an eigenvalue of the 
integral equation. The corresponding ¢ is called an eigenfunction. Any eigenvalue 
of (3.3.1) must be real and positive, and in general such an equation will yield 
an infinite number of eigenvalues 4,, A,,... with corresponding eigenfunctions 
Qi» $5..... We can assume that the eigenvalue sequence is non-increasing and 
that eigenfunctions form an orthonormal sequence, in the sense that 


1 for i = j, 


(3.3.2) [ e109. dt = i for i z j. 


A fundamental result in the theory of integral equations is the following, 
known as Mercer’s theorem. For further discussion of this result see Riesz and 
Sz-Nagy (1955, p. 245), which gives a detailed discussion for the one-dimensional 
case, or Zaanen (1956, p. 534), who treats the N-dimensional case. 


Theorem 3.3.1 


Let R(s,t) be a continuous, non-negative definite function on the compact 
interval T x T c 9?", with eigenvalues A, and eigenfunctions ¢ ; satisfying (3.3.1) 
and (3.3.2). Then 


(3.3.3) R(s, t) = DA; OS) O,0, 
j= 
where the series converges absolutely and uniformly on T x T. 


The fact that the convergence in (3.3.3) is uniform is a strong result and 
immediately implies 


(3.3.4) lim ff |R(s, t) — Y 46960 ds dt — 0, 
moo j=l 


TXT 


a fact we shall use later. 

We are now in a position to commence setting up a Karhunen-Loéve 
expansion. For notational convenience, set T = I,, the unit cube in 2", and let 
R(s, t) be continuous and non-negative definite on I, x I, with the Mercer 
expansion R(s, t) = $7, 49 (s)ó (t). Furthermore, set 


(3.3.5) 48,1) = Y Apo) - 6401 


= R(s,s) + R(t, t) — 2R(s, t) 
and 


(3.3.6) pio = maxío(s, t): Is — tll < lul / N) 
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Finally, let {9(@)} be a sequence of independent standard Gaussian variates 
and for m > 1 set 


(3.3.7) Xt, o) = È V/A, 6 (00 0). 


while Xt) = 0 for all t. Then the following result, which is a slight variation 
on a result of Garsia (1972), holds. 


Theorem 3.3.2 
Suppose 


(3.3.8) [ctos u)!? dp(u) < co. 


Then, with probability one, the partial sums X(t) converge uniformly on I, c BN 
and, furthermore, are almost surely equicontinuous on 1,. More precisely, the 
following is true for all m: 


Q39)  |X*(s) — XW] < 16,/N [log B]'?p(Ils — tll) 
lis €i 


+ 16,/2N [ (—log u)!? dp(u), 


where 
Xs, w) — X(t, c) 
B(o) — sup i 2 ds d 
E NE PLAN pills — tA IN) 
and, furthermore, 
(3.3.10) E| Bœ)" < (44/2). 


To prove this result, which is the main result needed to set up a Karhunen- 
Loève expansion, we shall follow Garsia’s original argument, for which we 
require two preliminary lemmas. The first is a martingale inequality and the 
second a lemma related to continuity properties of real, non-random, functions. 
To state the first result, let X,, X2,... be a sequence of non-negative random 
variables satisfying 


(3.3.11) E{|X,,|} < oo, for m = 1,2,.... 
E{Xm+1| Xm, Xo po X1} € Xm with probability one, for m = 1,2,.... 


Such a sequence is called a submartingale. The following results are classical 
and readily available in virtually any book that incorporates martingale theory 
(see, for example, Doob, 1953, p. 317). 
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Lemma 3.3.1 


Let X,, X2,- .. bea submartingale sequence of non-negative variables. Then if 
lim, , 4, E(| Xml} =K < oo there exists arandom variable X „ such that X m> X « 
with probability one and E{|X |) = 


Lemma 3.3.2 


For a non-negative sequence of random variables satisfying (3.3.11) the following 
inequality holds for any a > 1 and any m > 1: 


a a 
zf sup xar} < 5) E(|X,I5. 
1xkzm x-1 
To state the next result we assume we have two functions p(u) and y(u), 
defined on [— 1, 1] and (— oo, oo), respectively, satisfying 
(3.3.12) plu) = p(—u) | 0 asu | 0, 
(3.3.13) y(u) = v(—u) 1 oo as |u| f oo. 


Furthermore, we shall assume that y is convex. 

If we now call an interval J c 2 a hypercube in 4" if it is of the form I = 
TIX, [a;. b;] and b; — a; = e(1) is the common length of its edges, we have the 
following lemma. 


Lemma 3.3.3 


With the above assumptions, let f (t), te 1, C 4?" be a continuous real-valued 
function for which 


f .- f 
(3.3.14) AE (5° we) eser 5 


for all hypercubes I c I,. Then 


lis~¢]| 


(3.3.15) |f) — f(D <8 f Y- (Bu-2%) dpl), 


for alls, t € 1,. 


Proof 


Lets, t be two points in 1, and let Qo be the smallest hypercube containing both 
of them. Furthermore, let {Qm} be a sequence of hypercubes for which 


(3.3.16) Om © Om—1> form = 1,2,..., 
(3.3.17) pe) — ip(e (Qu 1) for m = 1,2,.... 
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We commence by noting that from (3.3.14) and the convexity of v and Jensen's 
inequality (Lemma 1.3.6) it follows that 


y [ fom — fon) XO. -1 (o = 5) 
( Meo y) AG Aa J | os) 
< B[4(Q4)A(Q,, - D] ~ 
where, for any hypercube Q c & we set fy = [4(Q)] ' fo f(t) dt. Inverting V 
in the above inequality gives 
(3.3.18) | fon — fo... € P(e(Qn- Y T (BEA(Q,)A(Qm -:)] 7). 
Setting e,, = e(Q,,) it follows from (3.3.17) that 
P(CCOm-1)) = 4[P(Em) — Plem+1)]- 


Noting u€[e,,,,e,] implies u^?" > 4(Q,)4(Q,,.,) and substituting into 
(3.3.18) leads to 


fon — fon <4 Í 7 V -1(u-2*) dp(u). 


em*i 


Summing this inequality for m = 1, 2,... yields 


eQ) 
lim sup | fom — fol < 4 Í Y= (Bu7?") dp(u). 
0 


m-?oo 


If we also ensure that the sequence of sets {Q,,} are chosen to decrease to the point 
s, the continuity of f and the last inequality gives us that 


Ils - ti 
ISO- falsa — Vou dpo). 
0 
Since the same inequality must hold with s replaced by t the lemma is proven. 


Proof of Theorem 3.3.2 


We now have the necessary tools with which to develop a proof of Theorem 
3.3.2. To commence this we first fix sand t in J, and define a sequence of random 
variables {Pm} by 


Pa | Xs) — X(t) 
"=OP 2 [3N ps — th//N) 


where X(t) is, of course, the partial sum defined by (3.3.7). 
If we now set 


2 
| form = 1, 2,..., 


—O xem) — X(t) 
padis = tN) ` 
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then Q,, is easily seen to have a zero-mean Gaussian distribution with variance, 
62 say, less than one. Thus 


sen = rni 


c2 — 1/2 
--i . 
(1-33) 


The second equality can be obtained either by straightforward integration or 
using the fact that Q2/o2 is a chi-squared variate. Note that since o2, < 1 we 


now have 
1 -1/2 
ERGA 
E{Pn} < ( x) 


2N 1/2 
= Gres? : for all m z 1. 
Furthermore, from the definition of X? we have 
Pari = P, exp(R,, 1) 
where 


Àm 18 ei DO e CS E Pm+ (t)]* 
+ /Am+19m+ LP m+ 1S) ~ Pmt OILS) — XO] 


[2,/N pls — tl/A/ N)P 


Since conditioning on P,,,..., P, is the same as conditioning on 6,,,..., 4, 
we have 


ub 
Rosi E 


E{P nat |P,. EXE P,} = P,E{exp(Rn+1)|8m> eg 0j 
< P, exp(E{Rm+1 | Orns o oy 013). 
where the inequality follows from a conditional version of Jensen's inequality 
(Lemma 1.3.6). But the expectation here is clearly positive, yielding 
E(P, Lad PSU usus Pi} < Pm: 


That is, {Pm} is a submartingale. 
Hence it immediately follows from Lemma 3.3.2 (with x = 2) that, for each s 
and t in h, 


1/2 
Ef mas P3) < 4E(P2) < en j) 


ksm 


Integrating over J, x I, and using Fubini's theorem gives us 
X'9(s) — | Xt) 2N F 


2 
E Í max ex o| D we A ds dt < E 
feel kem 2 /N pls — tll /N) 2N — | 
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Upon letting m f oo in this inequality and using Lemma 3.3.1 and the monotone 
convergence theorem we obtain the moment inequality 


2N \1? 
2 ONER 
E|B(o)| «x ~- z) l 


This result implies the almost sure finiteness of the variable B(œ) so that we 
now have, with probability one, for m = 1, 2,..., 


X™s) — X (t) | 
ds d : 
NECS = tN) s dt < B(o) < oo 


Lemma 3.3.3 can now be applied in an almost sure sense to yield that with 
probability one 


Ils - €]l 


(3.3.19) |X(s œ) — Xt, o)| < 16/N | (logLB(o)u -?*])!? dp(u), 


o 


whichis precisely (3.3.9)with the left-hand sideoftheequation slightly rearranged. 
Finally, note that since the ġ, are orthonormal 


Í K Je E^ di i 


o 


Ee iS yu so] ds, 


o 


which tends to zero as m — oo according to (3.3.4). Hence 


2 Y nozol = y A, = Í R(s, s) ds < oo 


o 


implying that, for almost all o, $2. , 4,02(o) is finite. Thus, for m > m', 


| X(t, o) — Xt, w)? dt = Yi 4,02(0) 
kaw 


Io 

> Qas. as m, m oo. 
This implies the almost sure uniform convergence of X(t, œw) in I,, for we now 
have (from 3.3.19) that the X“ are almost surely equicontinuous and the above 


relationship establishes their almost sure mean square convergence. 
This virtually completes the proof of the theorem. All that remains is to 


establish the moment inequality E|B(w)|* < (A /2y*. To do this we first note 
X (s?) — x eee) 


that 
2 
2,/N p(Is'? — «9| / N) | 
x ds! ds dt” -+ - ae) 


N 


(3.3.20) E|B(o)|" < a Í sup exp| M, 
INI m 


k=1 
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Interchanging the order of integration in this expression and using Lemma 1.3.5 
(3.3.20) yields 


N XAH — X(t 
E|B(o) IN « Í f Len exp| S ) - (t | ds? de®. 
mmasi m — L2p(iis® — (lA /N) 


Applying again the submartingale arguments used above, it follows as before 
that 


E|B(o)l" < (44/2y*. 
This completes the proof of Theorem 3.3.2. 


One immediate implication of this result is that if we have a continuous, 
non-negative definite function R(s,t) on J, x I, which we use to define a 
function p(u) via (3.3.6) which satisfies (3.3.8), then a separable Gaussian process 
with almost surely continuous sample paths and this covariance function exists. 
In fact, the process can be defined as the limit of the sequence X(t) defined 
by (3.3.7). 

We now turn to the main result we are seeking, the Karhunen-Loeve ex- 
pansion of a Gaussian process, which also generates sufficient conditions for 
sample path continuity. The result is as follows. 


Theorem 3.3.3 


Let X(t), te I, c 4", be a real-valued, zero-mean, Gaussian random field 
with continuous covariance function R(s, t) which has the Mercer expansion 


RGD = 2.26900. 
Define, by a mean square integral, the sequence of random variables 
(3.3.21) 8o) = A; !? Í X(t, œ$ (t) dt, for j = 1,2,.... 
Then if the function p(u) defined by (3.3.6) satisfies 
(3.3.22) f iog u)!? dp(u) < oo 
the sample of functions of X(t) are continuous on I, with probability one and 


XO) = Y Aj?6(00(0) > X(t,o) ^ asm oc 
jot 
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uniformly for te I, with probability one. Furthermore, if this holds, then with 
probability one the modulus of continuity w(6) of the sample functions can be 
majorized as in the following expressions, in which B(w) is a positive random 
variable for which E|B|N « (4/2)" : 


(3.3.23) a (5) € 164/N [log B]'?p(0) + 16,/2N - Í a u)!? dp(u). 
0 


Proof 


Given Theorem 3.3.2 and the (implicitly assumed) separability of X the 
results of this theorem will follow easily once we can show that the sequence 
(8) defined by (3.3.21) is one of zero-mean, unit-variance, independent Gaussian 
variates. We really need only check variance and independence: 


E(8,8) = Ely! [ | xosexwsso as ath 
IY Io 


= (hà)! I | R(s, 06.) (t ds dt, 


on interchanging the order of integration. Now use the eigenfunction property 
(3.3.1) of ġ; to obtain 


À; 1/2 
Ete) = (2) [ otos 


Since the $; are orthonormal (cf. 3.3.2) the above integral is zero if i # j and one 
otherwise. Hence the 0; are independent and of unit variance. 


This completes our development of the Karhunen-Loeve expansion for 
Gaussian fields, and effectively concludes our discussion of suitable regularity 
in the Gaussian case. Condition (3.3.22) is, however, often unwieldy, so in the 
following section we shall denote a little time to using (3.3.22) to obtain simpler 
(although generally stronger) conditions sufficient for almost sure sample 
function continuity of Gaussian fields. 

Before moving on, however, it is worthwhile to note that the study of con- 
tinuity properties of Gaussian processes is much wider than we have indicated 
here. For example, it is possible to show that (3.3.22) 1s necessary, as well as 
sufficient, for sample function continuity. An excellent review of this subject 
is given by Dudley (1973) with some more recent results available in Fernique 
(1975). Dudley's paper studies far more general processes than those we have 
considered, including, for example, set indexed fields and the notion of metric 
entropy, as well as providing an exhaustive list of references. 
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34 SIMPLE CONDITIONS FOR CONTINUITY IN THE 
GAUSSIAN CASE 


Let us commence by recalling Theorem 3.3.2, which, among other things, 
states that a real-valued Gaussian field with a continuous covariance function 
will have continuous sample paths on I, c 4" with probability one if 


(3.4.1) [ tes u)'!? dp(u) < oo, 
0 

where 

(3.4.2) pu)- max [E|X(s) — X(0P1!. 
ls — tl] < lul /N 


For what will follow it is often more convenient to write (3.4.1) in a slightly 
different form, which is given in the following lemma. 


Lemma 3.4.1 
If p(u) is defined by (3.4.2) then, for any M > 0, 
1 oo 
(3.4.3) Í (—log u)!? dp(u) < oo <> Í p(e^*) dx < oo. 
0 M 
Proof 


Let M > 0 be fixed, and set 7 = exp( — M?). Note that 0 < y < 1 and that 
from the monotonicity and boundedness of p(u) on [0, 1] it follows that 


Í (—log u)!? dp(u) < oo <> [toe u)!? dp(u) < oo. 
o 0 


Using integration by parts we have 
"n "n 
J (og 9"? apt) = -log i ^po)3t + 3 [tog 7 uc tp) du, 
0 


so that the integral on the left will be finite if and only if the rightmost integral is 
finite. However, upon making the substitution x = (—log wu), this integral is 
easily seen to be equivalent to f} p(e~*’) dx, which completes the proof of the 
lemma. 


We shall now place a more specific growth rate restriction on the function p(u) 
than that inherent in (3.4.1), which we shall show not only, a fortiori, guarantees 
sample function continuity but also yields useful and tractable information on 
the modulus of continuity. The following result, insofar as it deals with sample 
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continuity, is due originally to Belyaev (1961), who treated stationary one- 
dimensional processes, and Dudley (1965) and Belyaev (1972a), who treated 
random fields. A probabilistic bound on the modulus of continuity similar to 
(3.4.5) can also be found in Belyaev (1972a) who claims to have obtained his 
result, without giving an explicit derivation, from the general Theorem 3.2.5. 
The technique we use below is easily adaptable to check the existence of bounds 
like (3.4.5) for any specific covariance function. 


Theorem 3.4.1 


Let X(t), te 2", be a real-valued, zero-mean, Gaussian random field with a 
continuous covariance function. T hen if, for some 0 < C < oo and some £ > 0, 


C 


E REPRE 
(3.4.4) E| X(s) X(t)| < |loglis _ tl pte 


for all s, t e Ij, X has, with probability one, continuous sample functions over I,. 
Furthermore, defining the modulus of continuity w y(n) as in (3.2.1) gives, for any 
a> 0, 


(3.4.5) n "NPioy(g) > a} 3 0 as y > 0. 


Proof 


To establish continuity we need only, in view of Theorem 3.3.2 and Lemma 
3.4.1, show that f% p(e~*’) dx < oo for some M > 0. But, by (3.4.4), if |u| < 


1//N, 


[o 
3.4.6 2(u) = E|X(s) — X(0? < —————, 
(3.4.6) p'(u) HEU | X(s) (t)| logu /N)] ** 


so that if we take M = (1 + log,/N)!? we have 


ci? Í llog / Ne-* || a *92 dx 
M 


IA 


[e ?) dx 


MT Í Ix? — log /N | *? dx 
( 


1+ log /N)! 2 


which is clearly finite. Thus continuity is established. To complete the proof we 
need only obtain the inequality (3.4.5). To do this we note that by Theorem 3.3.3 
the modulus o(1) is bounded, with probability one, by 


- n 
(34.7) I6 /N[log B]!2p(q) + 16,/2N Í (—log 1)? dp(u) 
0 
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where B is a random variable satisfying E| B|" < (4,/2)" . Integrating by parts 
we obtain 


(3.4.8) 
[tee 1)? dp(u) = [(— log u) "p(u)]h + 4 Í "(logn Pu" 1 pg) du 
0 


= [Ilog ul! 2p(u)]3 + [re dx 


where now M = |log 5|!/?. If we now assume that y € (0, N^!/2) we can apply 
(3.4.4) and (3.4.6) to a combination of (3.4.7) and (3.4.8) to obtain that wy(y) is 
almost surely bounded above by 


16, /N[log B]'^C'? [logl,/N || +°? 
* 16QC) ^N log v? oglo NI 72 + f Ix tog yN 10 ax} 
M 


If we evaluate the integral and (by dropping terms in log J/N ) take some simple, 
but rough, bounds for some of the terms, we obtain 


@y(n) < K;[log B]'?|log j| "+? + K,|log n7? a.s., 


where K, and K, are finite constants. But then, setting f = |log 5|, so that 
f > œ as y > 0, we have 


" OL -K p? 2 
n "Ptoy(gp > a} < PB > exp (“iar | . 


Applying Markov's inequality and the fact that E(|BIV) < (4./2)" to this 
inequality yields 
(34.9) 


q^ P(oy(n]) > o) < (4,/2)" exp] -N| 2 + (& para — = "y 


which clearly tends to zero as ņ > 0 (f — oo) for any « > 0. This establishes 
(3.4.5) and thus the theorem. 


In both the discussions of the current and preceding sections we have not 
made an assumption of homogeneity of the field X. It is natural to expect that 
in such a case the conditions we have encountered become somewhat simpler, 
and, indeed, this is the case. Consider firstly the function p(u), for which we have 


p(u- max E|X(s) — X(t)? 
IIs ~ t|] < Iul /N 

= max E|X(0- X0)? 
Ht ll < lul N 


max 2[R(0) — R(t] 


ell < ju]. N 
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where, as usual, R(t) is the covariance function of the field. This result leads to the 
following trivial consequence of the preceding theorem. 


Corollary 


If X(t) satisfies the conditions of Theorem 3.4.1 then condition 3.4.4 can be 
replaced by 


C 
R(0) — R(t) < —————. 
©- RO < Togi 


and the theorem continues to hold. 


Another question that arises naturally in the case of a homogeneous process 
is whether or not continuity properties for Gaussian fields can be read off from 
the spectral distribution function rather than from the covariance function, 
as we have been doing up until now. In principle there is no difficulty here. 
Given a spectral distribution function one can, in view of the spectral representa- 
tion theorem (Theorem 2.1.2), always compute the corresponding covariance 
function and use it to check for continuity. Nevertheless, it is sometimes useful 
to have an explicit result stated in terms of the spectrum, and we shall derive 
such a result now. The following theorem is due, for N = 1, to Marcus (1973). 
Watanabe (1973) has a similar, though slightly weaker, result for N-dimensional 
fields. 


Theorem 3.4.2 
Let X(t), te Z^, be a real-valued, zero-mean, homogeneous, Gaussian random 
field with continuous covariance function 
R(t) = Í cos( - t) dF(A). 
aN 


For real positive, A, and m > 1, set 


F*(4) = Í dF(A) FR = 1 — F*(2"). 


lA |] «4 


Then if 


pu)= max 2[R(0) — R(0] 
Nel] < juvy 


we have 


o0 1 
(3.4.10) Y (m FR"? o (—log 10! ? dp(u) < ©. 
m 1 0 
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Although (3.4.10) is known to be the best possible result of this form (Marcus 
and Shepp, 1972) the summation in (3.4.10) is not always easy to evaluate. 
Thus we shall soon give a weaker condition on the spectrum that ensures the 
convergence of this sum, yet is somewhat simpler to check. In order to prove 
Theorem 3.4.2 we require the following lemma, stated in Hardy, Littlewood, and 
Polya (1934, Theorem 3.4.5) and due originally to Copson (1928). We omit the 
proof. 

Lemma 3.4.2 
If ay, a5,... is a sequence of positive numbers and p € (0, 1) then 
oo H oo p 
Š ar < p5 È (m is. 


Proof of Theorem 3.4.2 


The proof of (3.4.10) is based on that used in Marcus (1973) to establish the 
result for one dimension. We commence by noting that, for any t e A”, 


RO) — R(t) = NE — cos(t - 3)] dF) 


=2 sin? Gt - X) dF(A) + 2 Y sin?(3t - X) dF(A) 
j70 vA; 


All «1 
where A, is the annulus 
= {RE AN X < lal) < 244. 


If we now note that |t - | < Ilt] - |A|| and |sin x| < |x|, we immediately have 
for any t for which 27 ("*? < |t|| < 27"* that 


RO) - RO < Hil? 3. I, PIP AFA) + Y [ aro) 


j=m+1 «Aj 


1 m 28 
Sy 8, 2 3m FF — FR) + F41- 


For 27 +2) < y « 27™*DY define 


m 925 
(3.4.11) $?(u) = > + p xm (Ff — FELT FR 


Then it is clear from the monotonicity of ¢ that, at least for |u| < 4, 


plu) < A/N u). 
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Hence by Lemma 3.4.1 it is sufficient to prove that 
(3.4.12) Í $G/Ne-*^) dx < œ. 
(log 2)1/2 
It is straightforward to check that (3.4.12) is equivalent to 
(3.4.13) Y m !?$(27") « œ. 
m=1 


Hence this is all we need verify. Using (|x| + |y|)'/? < |x|V? + |y|!2 we see 
from (3.4.11) that (3.4.13) will be true if 


œ m 1/2 
(3.4.14) Y [m^!27" Y 29(Ft — Fh) <œ 
m=1 j^0 
and 
(3.4.15) Y. (m^! F£)'? < oo. 
m=1 


The latter condition holds by assumption (i.e. 3.4.10) so we need only check that 
(3.4.15) implies (3.4.14). However, using the same inequality as before we see 
that the left-hand side of (3.4.14) is bounded above by 


¥ 2°" WEF — FH)? x Y rm - FH)? 
PAL, do j=0 m=j 


2 > (FF ES Ff). 
j=0 


Applying Lemma 3.4.2 (with a; = Ff — Fj, > 0) to the last sum and noting 
(3.4.15) immediately establishes that the last sum is finite, completing the proof. 


To conclude our study of continuity properties for Gaussian fields we obtain 
a simple condition on the spectral distribution sufficient that ensures continuity. 
The following result was first stated for random fields by Delporte (1966) who, 
although he does not give a proof, states that it can be derived from a result 
equivalent to our Theorem 3.4.1. The result in the one-dimensional case, which 
involves quite complex arguments, is due originally to Hunt (1951). The proof 
given below, which relies on the previous theorem, is actually rather simple. 


Theorem 3.4.3 


Let X(t), te A“, be a real-valued, zero-mean, homogeneous Gaussian process 
with spectral distribution function F(X). Then X has almost surely continuous 
sample functions over I, if, for some £ > 0, 


(34.16) f loga + tipi aria < «e 
AN 
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Proof 


In view of the preceding theorem we need only show that (3.4.16) implies 
the finiteness of 3. ; (m^ ! FX)! ?. Note first that 


2m m log 2 
Í A7 (log 4) !? di = Í x V dx 
2m-1 ( 


m — 1)log 2 
> log 2(m log 2)7 !? 
= m "(log 2). 
Using this it is immediately obvious that 
2m 


Y (mF)? < og 2-17 Y gg [A 00g 2)? dA 
m=2 m=2 gm 


oo 2m 
Y [1 — F*(4)]247 ! (log 4) !? da 
m=2 


2m -1 


IA 


Í “TL — F*GJ]1217 log 2)! då. 
2 


= [a — F*(4)] 247!" (log Arm x [47 "(log Ay !] dà 
2 


where 5j = (1 — £)/2 and e, of course, appears in (3.4.16). Applying the Cauchy- 
Schwartz inequality to the last integral we obtain that its square is bounded by 


Í [1 — F*(A)]A~ ! (log 2)! ^? dà x Í A7 (log 4*7? dA. 
2 2 


Since y < 4 the second integral is clearly finite. Using integration by parts, we 
see that the finiteness of the first integral is equivalent to the finiteness of 


JA |log 4127?" dF*(4). 
This is bounded above by 
[ toea + A)]?> 2" dF*(A) = [ toga + Ay]! + dF*(A). 
We leave it as a (straightforward) exercise for the reader to show that the 


convergence of the final integral is in fact equivalent to (3.4.16) and thus com- 
plete the proof of the theorem. 


CHAPTER 4 


Geometry and Excursion Characteristics 


The preceding two chapters were concerned with setting up an appropriate 
probabilistic framework prerequisite for the study of random fields, a study 
to which we could now turn. However, our main interests lie with certain 
geometrical problems generated by these fields, and in order to study these it is 
first necessary to study some geometry. Consequently, the present chapter will 
be concerned only peripherally with random fields, in that it will concentrate 
most heavily on certain geometrical aspects of non-random functions. However, 
throughout the chapter, we shall keep in mind the fact that ultimately our 
interest is in random functions, and this consideration will have an important 
role to play in motivating which of their geometrical properties we shall study 
in the most detail. 

Recalling from Chapter 1 that the excursion sets 


A, = (te I: X(t) > u} 


seemed to be closely related to many interesting geometrical problems, and 
that in one dimension the geometry of these sets was particularly simple, the 
chapter commences by looking briefly at this rather special situation. Here, we 
already know, the geometry of the excursion sets is intimately related to the 
level crossings of X. Thus, after briefly reviewing level crossing theory, the 
remainder of the chapter uses this theory to motivate a search, through the 
concepts of integral geometry and differential topology, for a functional that 
generalizes to the excursion sets of functions on AZ, N > 2, the notion of the 
number of level crossings of a function on #'. Two such functionals will be 
found. When they are evaluated on a particular excursion set they will yield 
what we shall call excursion characteristics for that set. In the following chapter 
we Shall take a closer look at these excursion characteristics in the random 
setting and derive some of their distributional properties, thus commencing 
our main study—that of the geometry of random fields. 
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As we have just noted, the excursion sets ofa random field take on a particularly 
simple form when the dimension of the underlying parameter space is one, In 
this case they are composed of à number of disjoint closed intervals and so have 
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a rather simple geometrical structure. Because of this simplicity and the close 
relationship they have with level crossings, a great deal is known about them, 
and the associated literature is vast. In this section we shall give a brief qualita- 
tive review of this literature, with two main aims in mind. Firstly, because of the 
geometric simplicity of the one-dimensional case, it is natural to treat it inde- 
pendently of the more general situation in which sophisticated geometrical 
concepts are of central importance. Secondly, since the one-dimensional theory 
is well established, a review of what is known for this case provides an indication 
of the type of results we could hope to derive in the general case. 

Thus, since our review is intended to be primarily motivational, it is by no 
means intended that it also be exhaustive. The most detailed treatment of level 
crossings is still given in the now classic monograph of Cramér and Leadbetter 
(1967), some of which has been brought more up to date in Leadbetter (1972b). 
An excellent review of level crossings from an applied viewpoint is given by 
Blake and Lindsey (1973), while Marcus (1977) covers the most recent theoretical 
advances. 

To simplify the notation a little let us consider the number of upcrossings ofa 
stochastic process on the unit interval [0, 1] and denote this by N,, where u 
is the level involved. From Definition 1.7.1 of level crossings it can be easily 
shown that N, is a well-defined random variable if the underlying stochastic 
process possesses sample paths that are almost surely continuous. However, 
even in this case, there is nothing to guarantee that N,, although well defined, 
is necessarily finite. Indeed, this is not the case unless certain restrictions are 
imposed. Consequently, the study of level crossings breaks up naturally into 
two disjoint subject areas. One covers the ‘finite’ situation when either N, < oo 
almost surely (or E{N,} < oo), while the other covers the complementary 
‘infinite’ situation when N, = oo with probability one (or E{N,} = oo). Since 
we shall not encounter the random field analogue of the infinite case until 
Chapter 8 we shall restrict the current review to the finite case in which N,, « oo 
as. 

The serious mathematical study of level crossing phenomena began with the 
work of Kac (1943) and Rice (1945). Both of these authors were primarily 
concerned with obtaining the mean value of N, when the underlying process 
X(t) was zero mean, stationary, and Gaussian. Rice derived the following 
theorem, under a variety of extra conditions. The function R of the theorem is 
the covariance function of the underlying process. 


Theorem 4.1.1 


If N, is the number of upcrossings of the level u by a. zero-mean, stationary, 
almost surely continuous Gaussian process on (0, 1] then 


TE CN 
(4.1.1) EIN) m exp| |}. 


Where a? E!|[NGY 1. R(0) and A, R'(0). 
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Equation (4.1.1) holds regardless of the finiteness or otherwise of A,. Hence, 
by Theorem 2.2.2, a necessary and sufficient condition for such a process to have 
a finite-mean number of upcrossings is simply that it possess a mean square 
derivative with finite variance. Following Rice's original derivation of (4.1.1) 
the conditions under which the result holds were successively weakened by a 
number of authors, including Ivanov (1960), Bulinskaya (1961), Ito (1964), 
and Ylvisaker (1965), the final two authors giving minimal conditions. 

This formula for E{N,} is of fundamental importance in applications of 
Gaussian processes, for a variety of reasons. For example, it has implications 
for reliability computations related to various physical systems. Suppose we have 
atime-dependent system (such asa radio receiver) which operates in the presence 
of an interfering random process (static) and will operate successfully provided 
that the interference does not become too strong. Then the object of primary 
consideration here is how often the random process becomes too strong, and 
we have a problem related to level crossings in which the above theorem 
clearly provides useful information in terms of assessing the reliability of the 
system. 

Furthermore, suppose we are interested in the probability that a stationary 
process does not exceed some level u, at all, throughout the time period [0, T]. 
Then if N,(T) denotes the number of upcrossings in this time period we have 


P(X(t) > u, some t e [0, T]) = P(X(0) > uor N,(T) = 1} 
€ P{X(0) > u} + E{N,(T)} 
= P(X(0) > u} + TE{N,}. 


If the level u is high and T is large, then for a wide class of processes the second 
term in the above bound can be shown to be the dominating one, so that the 
relevance of (4.1.1) is clear. 

A second rather useful application of (4.1.1) lies in parameter estimation. It 
is well known (Cramér, 1965; Cramér and Leadbetter, 1967) that, under certain 
conditions (which ensure that process values at distant time points are essen- 
tially independent) on the covariance function of a stationary, zero-mean 
Gaussian process with unit variance, the following asymptotic result holds (see 
Theorem 6.9.4 for a formal statement of this result in the N-dimensional, N > 1, 
setting): 


(4.1.2) lim P 


T> œ 


KOBO « } o ast-em- 
| max A(T) sz exp[—exp(-z)], 


where 
A(T) = (2 log T) "?, 


log(AV?/2n) 


BCT) = log Ty"? 4 y 
(D= Cle D top py 
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An extreme value result of this nature is of obvious interest, and an inspection 
of the formulae of (4.1.2) yields that the only parameter related to the distribu- 
tion of X that appears here is the square root of 45. Hence it is of interest to be 
able to estimate this parameter from realizations of the process. Equation 
(4.1.1) allows us to do this, since it immediately suggests that for all real u the 
statistics 


2 

(4.1.3) A(u) = 2n6 exo( 352] N, 

26 
yield unbiased estimators of A1. Clearly it would also be of interest to combine 
a variety of estimators 6(u) based on more than one level, and this problem has 
been considered by Hasofer and Sharpe (1969), Lindgren (1974), and Bjórnham 
and Lindgren (1976). We shall consider the random field version of (4.1.3) in 
Chapter 5. 

From this brief discussion it should be clear that (4.1.1) holds a central 
position in the study of single-parameter, Gaussian stochastic processes. One 
of the major tasks of this and the following chapters will be to develop a useful 
analogue of this result for random fields. 

Given the importance of Theorem 4.1.1 it is natural to ask if more general 
information about the variate N, other than its mean can be obtained. The 
answer to this question is somewhat disappointing. While there does exist an 
integral expression for its variance, the integral involved is long and complex, 
and can be evaluated only numerically. Expressions for higher moments are 
generally too involved even for this form of evaluation. Hence, a fortiori, the 
exact distribution of N, is not available and we must have recourse to asymp- 
totics. 

For example, it is known that, again under certain conditions on the co- 
variance of a stationary Gaussian process that make its values in distant 
intervals asymptotically independent, the point process of such upcrossings 
tends, in a special sense, to a Poisson process as u and T tend to infinity at a 
prescribed rate. The most recent results in this direction are summarized in 
Leadbetter et al. (19792). Another important asymptotic result for Gaussian 
processes due originally to Malevich (1969) and sharpened by Cuzick (1976), is 
the following central limit theorem, which holds under certain conditions on the 
spectral density of X, and a growth condition on the variance of N (T), which we 
write as o?(T). This result essentially states that 


[No(T) — E{N(T)}/o(T) 5 N(,1) as T> œ. 


Perhaps the only other area directly related to excursion sets for single- 
parameter processes is the distribution of the intervals between successive 
upcrossings or, equivalently, between the initial points of excursion sets. Since 
this variate seems to have no relevance to the multiparameter situation, we shall 
not discuss it here. 
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The last main area of interest in one dimension has been related not so much 
to level crossings as to the number of local maxima of a process above a given 
level, and the form of these maxima when they are above very high levels. The 
structure of these maxima for Gaussian processes is in some senses almost 
deterministic, as they follow a rather definite form (see, for example, Lindgren, 
1970). Again, we shall study the multiparameter version of these results later on 
in Chapter 6. 

We can summarize at this point by noting that there are three main areas of 
interest in the study of level crossings: the mean value of N,, the asymptotic 
behaviour of N,(T) as either T or u and T together tend to infinity, and the 
related subject of local maxima above high levels. We shall look at all three of 
these problems for Gaussian, and related, random fields in Chapters 5 to 7, once 
we have determined how to appropriately characterize excursion sets in 
dimensions greater than one. 
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From the discussion of the preceding section, it would seem to be extremely 
worthwhile to be able to find, for the excursion sets of random fields on Æ”, 
N » 1, a random variable that characterizes them as fully as the number of 
upcrossings does for their one-dimensional counter-parts and, furthermore, 
is as readily amenable to probabilistic investigation. This secondary require- 
ment is of major importance, for the most obvious candidate in the first category, 
the number of components of the excursion set, or the closely related variate, 
the number of disjoint contour lines, cannot be directly studied in a probabilistic 
setting and so, at least in the first instance, must be excluded from consideration. 
The reason for this is reasonably simple. It is possible to compute from a knowl- 
edge of fi-di distributions the probability that a set t,,..., t, of points all lie in 
the excursion set A,, since this is simply P(X(tj) > u, i = 1,...,n). However, 
it is not possible from the fi-di distributions to directly compute the probability 
that they all lie, for example, in a single connected component of A,, and it is 
probabilities of this form that we need to know if we wish to study the distri- 
butional properties of the number of components of excursion sets. 

Given the problem we wish to solve and the difficulties inherent in the 
obvious approach, two paths are open to us. The first is to look for any character- 
istic of excursion sets that is amenable to probabilistic investigation, study it, 
and then pray that the results that can be derived are of significant value in de- 
scribing these sets. The second path lies in attempting a full topological in- 
vestigation of the excursion sets of non-random functions and then praying 
that the facts we discover are of use for their random counterparts. We shall 
adopt the second approach, in the belief that it is imperative, when dealing with 
the type of problems we are considering, to make use of the existing branches of 
mathematics that deal with N-dimensional geometry, rather than to use a 
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simple-minded approach based on our experiences in one dimension. For 
reasons that will later become obvious, the most natural branch of mathe- 
matics to turn to is that of integral geometry, a small part of which we shall now 
investigate in some detail. 

Quite comprehensive studies of integral geometry are available in the mono- 
graphs of Hadwiger (1957) and Santaló (1953, 1976), although virtually all of 
the results we shall derive can also be found in the paper by Hadwiger (1959). 
It seems not irrelevant to note that our interest will be in the classic integral 
geometry covered in these references, rather than the more recent integral 
geometry discussed, for example, by Matheron (1975). 

Essentially, we shall be interested in the study of a class of geometric objects 
known as basic complexes. Later on, we shall show that, with probability one, the 
excursion sets of a wide variety of random fields belong to this class, and the 
concepts that we are about to discuss are relevant to random fields. We com- 
mence with some definitions and simple results, all of which are due to Hadwiger 
(1959). 

We start by assuming that we have equipped 4" with a Cartesian coordinate 
system, so that the N vectors à; (with 1 in the jth position and zeros elsewhere) 
serve as an orthogonal basis for it. Then we can call a set of the form 


E = {te R:t; =a; jeJ, -œ «tj «oo,jéJ) 


ak plane of 2“ if J is a subset of N — k of the integers 1,..., N and a,,..., ay. y 
are fixed. That is, a k plane is a k-dimensional subspace of 4" generated by k 
vectors parallel to k of 8,,..., dy. 

We shall call a compact set B in 4" a basic if the intersections & ^ B are 
simply connected for every k plane & of A“, k = 1,..., N. This includes the case 
& = d". These sets, as their name implies, will form the basic building blocks 
from which we shall construct more complicated and interesting structures. 
It is obvious that the empty set $ is a basic, as is any convex set. Indeed, a convex 
set remains a basic under rotation, a property which actually characterizes this 
class of sets. Note that if B is a basic, then so is & A B for any k plane 4. 

Define now a set A c 4 to be a basic complex if it can be represented as the 
union of a finite number of basics such that the intersection of any of these 
basics is again a basic. Thus if 


A=B,U-:-UB 


where the B; are all basics we have that 


m 


By, Ote A Ba 


is again a basic for any combination of indices v,,..., v, k = 1,..., m. The set 
of basics 


p = PA) = By. Bm} 


is called a partition of A, and their number, an, is called the order of the partition. 
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The class of all basic complexes, which we shall denote by @&, possesses a 
variety of useful properties. For example, if Ae «2 then E Ae'€42 for 
every k plane &. In fact, if we denote the set of N-dimensional basic complexes by 
EAB“, then if £ isa k plane with k < Nand A e €", we have E ^ A e €4*. To 
prove this it suffices to note that if 


p= (B,,..., Bm} 
is a partition of 4 then 
píénB,...,é6nB, 


is a partition of & ^ A, and, since each & ^ B, is a k-dimensional basic, & ^ 
Ac GB". 

Another useful property of €4 is that it is invariant under those rotations of 
A~ that map the vectors 6,,...,6, onto one another. Furthermore, 44 is 
additive, in the sense that 4, Be €24 and AN B=¢ imply AU Be«€4. 
These and similar properties make the class of basic complexes quite large, and 
although 4" is certainly not as large as Z” it includes an immense variety 
of sets. 

As we have recently noted, we shall establish later that most random ex- 
cursion sets are basic complexes. As we are interested in generalizing the notion 
of the number of upcrossings of a single-parameter function it is clear that it 
would be useful to define an integer-valued functional o say, on €82 which in 
some sense performs this generalization. Noting that a single basic is certainly 
an analogue of a single component of the excursion set of a one-dimensional 
function, it follows, in view of our intended use of o, that one of its elementary 
properties must be, for all basics B, 


0 if B = $, 
1 otherwise. 


(4.2.1) ọ(B) = f 


Another property that we might reasonably require of ọ is additivity, in the 
sense that when each of A, B, A U B, A ^ B is a basic complex 


(4.2.2) (A v B) = q(A) + GB) — GA n B). 


An important result of integral geometry states that not only does a functional 
possessing these two properties exist but it is uniquely determined by them. 
We shall now proceed to obtain an explicit formulation for it. 

To do so, let p = p(A) be a partition of order m of some A e € into basics. 
Then we define the characteristic of the partition to be 


(4.2.3) XA. P) = EPB) - ZB, 0 B) + ++ 
FC IEUSHB, eso BS) £o 9€ DIR, oo By) 
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where X? denotes the summation over all combinations (v,,...,v,) of the 


elements (1,...,n,1 < n < m,andzis an indicator function for basics, defined 
by 


£($)—-0, &(B)=1, BH . 


Then if a functional ¢ satisfying (4.2.1) and (4.2.2) does in fact exist, it follows 
iteratively from these conditions and the definition of basic complexes that 


(4.2.4) (4) = A, p) 


for any A e €B and any partition p. In fact, we shall prove the following result. 


Theorem 4.2.1 


Let A c A” be abasic complex and p = p(A) a partition of A. T hen the quantity 
y(A, p) is independent of p. If we denote this by @(A), then ọ satisfies (4.2.1) and 
(4.2.2), and q(A) will be called the Hadwiger characteristic of A. 


Proof 


We proceed by induction. When N = 1, basics are simply closed intervals and 
points, and the empty set. Then if we write N(A) to denote the number of disjoint 
intervals and isolated points in a set A c 421, setting 


(4.2.5) (4) = N(A) 


yields a function satisfying @(A) = y(A, p) for every p and for which (4.2.1) and 
(4.2.2) are clearly satisfied. 

Now let N > 1 and assume that for all spaces of dimension k less than N 
we have obtained a functional g* on @B* for which g*(A) = x(A, p) for all 
A € €Z" and every partition p of A. Choose one of the vectors, 6;, and for 
x €(— oo, oo)let &, denote the (N — 1) plane of points in 2%, all of which have 
their jth coordinate equal to x. Let A e €" and let p = p(A) = (B,,..., Bat 
be one of its partitions. Then clearly the projections onto & of the cross- 
sections A © &, are all in €271, so that there exists a partition-independent 
functional p, defined on {A ^ &,, A e €" determined by 


(4.2.6) PLAN 6,) = o* (projection of A ^ &, onto £o). 


From q, we can define a new partition-independent function f by 
(4.2.7) f(A, x) = fA n 6). 
However, by the induction hypothesis and (4.2.7), we have from (4.2.3) that 


(4.3.8) f(A.x) LOB, 06) | EB, A BL Ab) +. 


74 GEOMETRY AND EXCURSION CHARACTERISTICS 4 
Consider for a moment just one of the right-hand terms, writing 
e(x) = (B, OO Bun 6). 


Since (x) is zero when the intersection is empty, and one otherwise, we have 
for some finite a and b that e(x) = lif a € x x b and e(x) = 0 otherwise. Thus 
£(x) is a step function, taking at most two values. Hence f(A, x), being the sum of 
a finite number of such functions, is again a step function, with a finite number 
of discontinuities. Thus the following right-hand limits always exist: 


(4.2.9) f(A, x) = lim f(A, x — y). 
yo 


Now define a function h, which is non-zero at only a finite number of points 
x, by 


(4.2.10) h(A, x) = f(A, x) — f(A, x7) 
and define 
(4.2.11) @(A) = X h(A, x) 


where the summation is over the finite number of x for which the summand 
is non-zero. Note that since fis independent of p so are ^ and q. 

Thus we have defined a functional on @&, and we need only check that (4.2.1) 
and (4.2.2) are satisfied to complete this section of the proof. Firstly, note that if 
B is a basic and B # 6, and if a and b are the extreme points of the linear set 
5; ^ B,thenf(B, x) = lifa < x < band equals zero otherwise. Thus h(B, a) = 1, 
while h(B, x) = 0, x # a, so that q(B) = 1. This is (4.2.1), since (o) = 0 is 
obvious. Now let A, B, AU B, A n B all belong to €2. Then the projections 
onto &, of the intersections 


Ané,, Boé,, AoBné,, ANBA, 
all belong to €28* ^ ! and so by (4.2.7) and the induction hypothesis 
f(A o B, x) = f(A, x) + f(B, x) - f(An B, x). 


Replacing x by x^ and performing a subtraction akin to that in (4.2.10) we 
obtain 


h(A U B, x) = h(A, x) + h(B, x) — h(A ^A B, x). 
Summing over x gives 
9A o B) = ofA) + GB) — GAB) 


and (4.2.2) is established. Thus we have that g(A) = x(A, p) and so (A, p) is 
independent of p; thus the proof of the theorem is complete. 


Note that since (A, p) is independent of j the choice of the vector 8; in the 
proof does not affect the final value of ¢. 
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Let us now pause and recall what we have actually established. Essentially, 
under the assumption that excursion sets can be shown to be basic complexes, 
and assuming that we want to define some functional on € which generalizes 
the notion of the number of level crossings to excursion sets, then if we want the 
basic properties (4.2.1) and (4.2.2) to be satisfied the Hadwiger functional is the 
only functional available. However, since this functional, as defined through 
X(A, p) in (4.2.3), depends very heavily on the topological structure of each 
set, it would seem to be just as difficult to obtain probabilistic information 
about @(A) when A is actually a random excursion set as it is to obtain similar 
information about the number of components of A. Fortunately, however, as 
we developed the proof of Theorem 4.2.1 we obtained an alternative way of 
computing (A) for any A e €B given explicitly in the following theorem, 
in which &, is as defined above and recalled below. 


Theorem 4.2.2 


The Hadwiger functional q, as defined by (4.2.4), has the following equivalent 
iterative definition for basic complexes A c P: 


(A) = number of disjoint closed intervals in A, N =1, 
pA) = X(o(A ^ €,) — «(4 ^ 6,-)), N>1, 
where 


QA ^ 6,-) = lim g(A A é,-,) 


ylo 


and the summation is over all real x for which the summand is non-zero. 


This theorem is a simple consequence of (4.2.4), (4.2.10), and (4.2.11), and 
requires no further proof. The previous theorem implies that the choice of vector 
9; in Z” from which we obtain 6, is irrelevant. (Recall that if x represents dis- 
tance along 8,, then &, is the (N — 1) plane orthogonal to j, intersecting it at 
distance x along it.) 

The importance of Theorem 4.2.2 lies in the iterative formulation it gives us 
for o, for, using this, we shall show in the following section how to obtain yet 
another formulation that makes the Hadwiger functional of an excursion set 
amenable to probabilistic investigation. 

Figure 4.2.1 shows an example of this iterative procedure in 2°. Note in 
particular the set with the hole ‘in the middle’. It is on sets like this, and their 
counterparts in higher dimensions, that the characteristic o and the number of 
connected components of the set differ. In this example they are, respectively, 
zero and one. 

Let us pause once more, this time to consider some of the properties of the 
functional e. Firstly, it does not automatically yield a generalization of the 
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Figure 4.2.1 Computing o in #? 


number of level crossings of a one-dimensional function F(t), for in this case 
qQ(A,CF, S)) is the number of components of A,, which does not in general 
equal the number of crossings of u in S. Note also that because of the more 
complicated nature of the concept of connectivity in 2, N > 2, the natural 
generalization to 2” of the number of connected components of a set in #! 
satisfying an additivity criterion is via the functional q rather than through the 
number of components itself. The relationship between the two when N — 2 
is actually rather simple: @(A) is the number of connected components of A 
minus the number of ‘holes’ in A. In higher dimensions, the simplest way to see 
what happens is to consider sets related to the N-dimensional unit ball, Sy, 
and the N-dimensional sphere, 0S,, for which we have 


e(Sy) Ll  e0S,)-21-(-DV 
If Ky , represents Sy with k non-intersecting cylindrical holes drilled through it, 
then, since both Ky , and its boundary belong to «P, 
q(Ky 4) = 1 + (—1)%k, 
while 
PKs) = [1 + (- D" *]( — k). 


Finally, if we write Ky , to denote Sy with | ‘handles’ attached, then 


Ply.) =1-1. 


Now let us see how this functional can be used to derive the concept we are 
seeking. 

To commence, we consider a function F(t) where t is restricted to lying in I,, 
the unit cube in #%, Furthermore, Ict us assume that the excursion set of F 
above the level n in /, is a basic complex. Sufficient conditions for this will be 
given in the following section. Then from what we have noted earlicr, when 
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N = 1 the number of upcrossings in [0, 1] is @(A) if F(0) < u or (4) — 1 if 
F(0) = u. In fact, by Theorem 4.2.2 the number of upcrossings equals 


€(A ^ [0 1]) — «(A ^ [0], 


where [0] denotes the set whose only element is the origin. 

This way of counting upcrossings generalizes naturally to higher dimensions, 
and if 7, represents all those faces of J, which contain the origin, a natural 
generalization of the number of upcrossings to random fields is the quantity 
T(A), defined as follows. 


Definition 4.2.1 


Suppose the excursion set A = AF, I,) of the function F(t): RY + 2?! above 
the level u is such that A is a normal body. Then we define the IG (integral geo- 
metric) characteristic of the excursion set to be the integer T (A), defined by 


(4.2.12) T(A) = q(A) — «(A n 1) 


Using Theorem 4.2.2 we can obtain a useful recurrence relation for P(A) when 
the dimension N is greater than one: 


(4.2.13) I(A) = A ^ I) — «(A ^ 1) 
= E(o(A Nn 6,) — «(A n €,-)} 
-E(o(Anl,n6, — (ANI, 9 6,-)} 
= X(I(A ^ 6,) — I(A ^ 6,-)) 


where now 
6, = {tel ty = x}, 
I(A ^ 6,-) = lim (A n @,), 


ytx 
and the summation in (4.2.13) is over all x e (0, 1] for which the summand is 
non-zero. Note that we can dispense with the point x = 0 in this sum since 


I(A ^ &) =T(An &-) 


from the definition of I. In the one-dimensional setting T(A,(F, [0, 1])) is simply 
the number of upcrossings of u by F in [0, 1]. 

Let us consider now some of the properties of the IG characteristic of an 
excursion set. Firstly, as we have just noted, it reduces to the number of up- 
crossings in one dimension so that it is a true generalization of this concept. 
Secondly, when the excursion set does not intersect 7, it equals the Hadwiger 
characteristic of the excursion, which is the natural generalization of the number 
of excursions of a one-dimensional process (which in this case cquals the 
number of uperossings). |l lowever, even in the case when the excursion set does 
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Figure 4.2.2 The IG characteristic in 2°: 
T(4) - 3 


meet 7, the IG characteristic has quite a valid meaning which can be phrased 
in terms of upcrossings and downcrossings. For example, in Figure 4.2.2, which 
gives an example in 2°, only those (shaded) parts of the excursion set not 
intersecting 7, contribute to the IG characteristic of the excursion, so that the 
total IG characteristic is three. The x points that contribute to this are at the 
same height as those marked with circles, and 7, is that part of the boundary 
depicted by the dotted line. From this example it is clear that when the excursion 
set does intersect 7,, we are considering, in a heuristic sense only, of course, 
some parts of the set as the analogue of downcrossings rather than upcrossings. 

The extension of the IG characteristic to the excursion sets of functions F 
defined on more general sets than I, is quite straightforward. Let S be a set in 
AN of the form 


(4.2.14) S=C,UC,uU---uC,, 


where each of the N-dimensional intervals C,,...,C,, intersect on, at most, 
(N — 1)-dimensional faces. Let c; be the corner point of C; closest to the 
(— o0,..., — oo) and let C; denote all those faces of C; which contain c;. Then if 
A = A,(F, S) is a normal body, we simply define the IG characteristic of A by 


(4.2.15) I(4) = p te(A ^ €) — e(A n €»). 


From this definition it is straightforward to see that the following recurrence 
relation holds: 


(4.2.16) I4) Y TASE) VAS, )] 
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as in (4.2.13). Furthermore, all the properties of I we have already discussed 
for the case S =/, also hold in this more general situation. Moreover, T 
is, like @, additive in the sense of (4.2.2). Thus, when we deal with T in the future 
we shall restrict ourselves to the notationally simpler case S = 1,, and leave the 
straightforward extension to sets of the form of (4.2.14) to the reader. 

Finally, we note that there is one further reason why T is a useful concept on 
which to base further study. We have already noted in Chapter 1 that when 
dealing with random fields, topologically meaningful functionals defined on 
excursion sets are generally not amenable to probabilistic investigation. 
However, although F is a topologically derived functional, under certain 
regularity conditions on the function generating the excursion sets it has a 
representation in terms of a finite point set in J,, and in this form it can be 
studied in the stochastic situation. Because of this it is possible to derive, for 
example, exact expressions for its moments and to explicitly evaluate its mean in 
the case when the underlying field is Gaussian. Once this can be done the fact 
that I'(A,) has topological meaning can be used to deduce a variety of other 
results about excursion sets. 
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We now have a twofold task confronting us in order to justify these claims 
about the value of the characteristic I in describing excursion sets A,(F, S). 
Firstly, we must show that excursion sets are in fact basic complexes, so that their 
IG characteristics are well defined and possess the properties we have already 
discussed. Secondly, we need to obtain a simple method of computing IF based 
only on those properties of F which, in the random setting, are completely 
determined by the fi-di distributions. However, before we can attack either of 
these problems we require three preliminary lemmas. The first is well known as 
the inverse mapping theorem of analysis, and we shall not give a proof. 


Lemma 4.3.1 


Let U c 4" be open and g = (g',...,g%): U — R” be a function possessing 
continuous, first-order partial derivatives 0g;/0t;, i, j = 1,..., N. Then if the 
matrix (0gj/0t;) has a non-zero determinant at some point t € U, there exist open 
neighbourhoods U , and V, of t and g(t), respectively, and a functiong" ! : V, > U, 
for which 


£ (gt) t forallte U,, 
gg 's)-s  forallse V. 


The second is the following, known as the implicit function theorem. 
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Lemma 4.3.2 


Let U c & be open and F: U — 9?! possess continuous, first-order partial 
derivatives. Suppose at t* € U, F(t*) = u and F y(t*) 4 0. Then the equation 


F(t,,..., ty. gto stu) =U 


defines an implicit function g which possesses continuous, first-order partial 
derivatives in some interval containing (tf, . . . , t& 1), and such that g(t, ..., t.) 
= tk. The partial derivatives of g are given by 


F. 
--— forj 2 L...,N-L. 
N 


Furthermore, if F is k times differentiable, so is g. 


Throughout the remainder of this section S will denote a compact subset of 
9?" for which A«(0S) = 0. 


Lemma 4.3.3 


Let F: R” — 9?! be suitably regular with respect to S at the level u. Then for all 
permutations (vi,. .., vy) of (1, ..., N) there are only finitely many t € S for which 


(4.3.1) F(t) — u = F, (=-= F (0-0 


Proof 


To simplify the notation we consider only the trivial permutation v; = j, 
j= L..., N. It will be clear from the proof and Definition 3.1.1 of suitable 
regularity that no loss of generality is involved in this assumption. To start, 
let g: S > 4" be the function defined by 


(432) g(Q0-F(ü-w  g(0-F, (0, forj=2,...,N. 


where g? represents the jth coordinate function of g and F, represents the partial 
derivative OF /0t;. 

Let t e S satisfy (4.3.1). Then by condition (3.1.3) of the definition of suitable 
regularity, t ¢ 0S. Thus if we write U to denote the interior of S, we have t e U. 
We now show that there is an open neighbourhood of t throughout which no 
other point satisfies (4.3.1). This implies that the points in S satisfying (4.3.1) are 
isolated, and thus, from the compactness of S, we have that there are only a 
finite number of them. This, of course, proves the lemma. 
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By Lemma 4.3.1, however, it is clear that such a neighbourhood will exist 
if the N x N matrix (0g;/0tj) has a non-zero determinant at t. However, this 
matrix has the following elements: 


og! 


> F (t), for j = 1,..., N, 


Agi 
CZF ih “Moris Oe NSN, 


Since t satisfies (4.3.1) all elements in the first row of this matrix, other than the 
Nth, are zero. Thus, expanding the determinant along this row gives us that it is 
equal to 


(4.3.3) (— I^! Fy(tdet D(t) 


where D(t) is the symmetric (N — 1) x (N — 1) matrix with elements F; (t), 
i, j= L,..., N — 1. Since (4.3.1) is satisfied conditions (3.1.2) and (3.1.4) of 
suitable regularity imply, respectively, that neither F(t) nor det D(t) is zero 
which, in view of (4.3.3), is all that is required. 

We are now in a position to prove the following theorem. 


Theorem 4.3.1 


Let F: RN — 9! be suitably regular with respect to S at the level u. Then the 
excursion set A,(F, S) is a basic complex. 


Proof 


When N = | the result is trivial. Consider N = 2. Since A, is clearly compact 
we need only show that the excursion set has a partition into a finite number of 
basics. 

Consider the set of points t € S satisfying either 


(4.3.4) F(t) —u= F,(t) =0 
or 
(4.3.5) F(t) — u = F,(t) = 0. 


For each such point draw a line containing the point and parallel to 6, or 6,, 
depending, respectively, on whether (4.3.4) or (4.3.5) is satisfied. These lines form 
a grid over S, and it is not difficult to see that the connected regions of A, 
within each cell of this grid are in fact basics. Furthermore, these sets have 
intersections which are either straight lines, points, or empty, and Lemma 4.3.3 
guarantees that there are only a finite number of them, so that they form a parti- 
tion for 4,. An example of this partitioning procedure is shown in Figure 4.3.1, 
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Figure 4.3.1  Partitioningexcursionsets. (Partitioning points are marked 
with 0 or ©) 


Thus we have shown that A, has a finite partition into basics so that A, is a 
basic complex. 

For N > 2 essentially the same argument works, using partitioning (N — 1) 
planes passing through points at which 


FQ -u-F,(0---:-—F,, O 


for any permutation (v,,..., vy) of (L,..., N). Lemma 4.3.3 again guarantees 
the finiteness of the number of basics forming the partition. We shall leave it 
to the reader to convince himself of the veracity of this approach. 

We now face the problem of obtaining a simple way of computing the IG 
characteristic of an excursion set. Let us start by making the simplifying 
assumptions that the dimension of the parameter space is two and S is simply 
the unit square I,. Let F: 4? — 4?! be suitably regular with respect to I, at the 
level u. Consider the summation (4.2.13) defining '(A,(F, 1,)); viz. 


(4.3.6) TA, = (4,06, -I(A,0 6,-)). 


xe(0, 1] 


where &, is simply the straight line t, = x. 

It is straightforward to see that A ^ &, is composed of a sequence of ny, 
say, disjoint closed intervals not containing the point (0, x), and one more 
such interval containing this point if F(0, x) > u. Thus (A ^ 6,) = n, and so 
values of x contributing to the sum clearly correspond to values of x where n, 
changes. It is immediately clear from continuity considerations that contribu- 
tions to I'(A) can only occur when &,, is tangential to the boundary of A (type I 
contributions) or when F(0, x) =u or F(1, x) =u (type II contributions). 
Consider the former case first. 

If £, is tangential to 0A at a point t, then F,(t) = 0. Furthermore, since 
F(t) = u we must have that F(t) 4 0, as a consequence of suitable regularity. 
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Thus, in the neighbourhood of such a point and on the curve 0A, t; can be 
expressed as an implicit function of t, by 


F(t,, g(t,)) = u. 
The implicit function theorem (Lemma 4.3.2) gives us 


d, — FAO) 


di | — F(t)’ 


so that applying what we have just noted about the tangency of &, to 0A we 
have for each contribution of type I to (4.3.6) that there must be a point te I, 
satisfying 


(4.3.7) F(t) 2 u, 
and 
(4.3.8) F,(t) = 0. 


Furthermore, since the limit in (4.3.6) (see 4.2.13) is one sided, continuity 
considerations imply that contributing points must also satisfy 


(4.3.9) F(t) > 0. 


Conversely, for each point staisfying (4.3.7) to (4.3.9) there is a unit contribu- 
tion of type I to (A4). Note that there is no contribution of type I to I' (A) from 
points on the boundary of I, because the regularity condition (3.1.3) is in force. 
Thus we have set up a one-one correspondence between unit contributions of 
type I to (A) and points in the interior of I, satisfying (4.3.7) to (4.3.9). It is 
easily seen that contributions of + 1 will correspond to points for which F , ,(t) «0 
and contributions of —1 to points for which F,,(t) > 0. Again, because of 
suitable regularity (condition 3.1.4), there are no contributing points for which 
F,,(t) = 0. 

Consider now type II contributions to (A). Using similar arguments it can 
be seen that we obtain a contribution of + 1 for every point t = (1, x), x € (0, 1], 
where 


F(t) 2 u, F(t) > 0, F(t) > 0, 
and a contribution of — 1 for each point t = (0, x), x e (0, 1], satisfying the same 
conditions. Thus if we define 
(4.3.10) T (Tj) = number of points I, satisfying (4.3.7) to (4.3.9) and 
F,,(t) < 0 (F,,(t) > 0), 
and 
(4.3.11) L2(D, ) = number of values of x, x e (0, 1], for which if t = 
(1. xt = (0, x), F(t) = u, F (t) > 0, F(t) > 0, 


we have the following result. 
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Theorem 4.3.2 


Let F: R? — 9?! be suitably regular with respect to I, at the level u. Then the 
IG characteristic of its excursion set A (F, I) is given by 


(4.3.12) I(A4) = ("7 — Ty) + (07-013) 


This is the point set characterization of the IG characteristic that we have been 
seeking. In the following chapter we shall show how to obtain some of its 
distributional properties when the underlying function is random. 

However, our quest has not yet ended, for the above characterization is 
valid only in the two-dimensional setting. In principle, we can extend the argu- 
ment that led to this result to higher dimensions. For example, arguing heuristic- 
ally for the moment, when N = 3 €, becomes the hyperplane 


E, = (te R:t; = x} 
and the points within the interior of I, which contribute to a characterization 
like that in (4.3.12) can easily be seen to be those points satisfying 
(4.3.13) F(t) — u = F,(t) = F,() = 0 and F,(t) > 0. 


The contribution of such a point, t* say, to the defining sum will depend on how 
the implicit function g(t,, t2) defined by 


(4.3.14) F(t,, t5, gis t2)) = u 


behaves at the point (tf, t1). If g has a local maximum or minimum at this 
point, the corresponding contribution is positive, while it will be negative if g 
has a saddle point there. (Note that 4.3.13 implies g must have a critical point of 
some kind at (tT, t#).) 

The behaviour of g is of course determined by the matrix G(t) with elements 


Gift) = git). 


However, (4.3.14) and (4.3.15), along with the standard rules for differentiating 
implicit functions and the facts we noted while providing Lemma 4.3.3, give us 
that 


Fft) 
F(t)’ 


Defining, as usual, the 2 x 2 matrix D(t) elementwise by 
Dit) = Fit) 


we have from elementary results of two-parameter calculus that a maximum 
or minimum of g at t*, and hence a positive contribution to [(A,), occurs if 
(4.3.13) holds and det D(t*) > 0, while a saddle point and negative contribution 
occur if dct D(t*) e 0. 


Gift) = 


for i,j = 1, 2. 


4.4 DIFFERENTIAL TOPOLOGY AND THE CHARACTERISTIC ¥ 85 


This gives a reasonable characterization of the points within the interior of I, 
that contribute to the IG characteristic of A,. A similar argument will lead to a 
characterization of the contributing points on 01,, but the argument is awkward 
to write down rigorously and the final characterization is messy. It is extremely 
difficult, however, to carry this style of argument over to dimensions higher than 
three in a rigorous fashion, for, in such cases, geometric intuition is much more 
difficult to rigorize. To work in higher dimensions it becomes necessary to 
introduce concepts such as homology groups and homotopies from the realm 
of algebraic topology, and these subjects are clearly outside our current range 
of interests. 

Thus, at least as far as the IG characteristic is concerned, we must leave the 
study of excursion sets where it now stands, with a proper point set representa- 
tion for I'(A,) in the two-dimensional case, and indications of a similar repre- 
sentation for N = 3. In most practical cases, this information is sufficient. 
Nevertheless, it is always pleasant to have a full theory, so in the following 
section we shall have a look at the situation for general dimensions. The reader 
interested only in the two-dimensional case can pass over that section without 
any loss. 
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Since the study of excursion sets in dimensions greater than two is qualita- 
tively different to that of the preceding section, we shall take a somewhat 
different approach in presenting it. It is possible in two dimensions to do every- 
thing in a complete, rigorous, fashion, while at the same time being reasonably 
brief. This is not possible in higher dimensions, and thus the approach we shall 
take will be to argue heuristically, initially, and ultimately justify the path we 
take by demonstrating that the end product of our endeavours is a useful one. 
The true justification of these heuristics, however, lies not so much in the final 
usefulness, but rather that everything we shall say does in fact have a rigorous 
formulation in terms of the concepts of differential topology. Good introduc- 
tions to these concepts can be found in the elementary book of Wallace (1968) 
and the more advanced text of Eilenberg and Steenrod (1952). The concepts and 
results which we shall rely on most heavily are to be found in the extensive 
monograph of Morse and Cairns (1969). 

The line of approach we shall take will be as follows. Firstly, we consider only 
suitably regular functions whose excursion sets over the unit cube I, do not 
intersect &I,, and use results of differential topology to obtain a point set 
representation for their Hadwiger characteristics. Then we relax the condition 
relating to 0/,, and use the point set representation obtained in the first stage 
to motivate the definition of a new excursion characteristic that, later on, we 
shall sec is also of wide applicability. 


86 GEOMETRY AND EXCURSION CHARACTERISTICS 4 


Thus, let us commence with a function F: 4" > #', suitably regular with 
respect to I, at the level u, and let A = A,(F, I) be its excursion set. Furthermore, 
let us suppose for the moment that 


Anal, = ¢. 


From the analysis of the preceding section we know that A is a basic complex, 
so that its Hadwiger characteristic (A) is well defined. However, A is a rather 
special basic complex, since its boundary, 04A, is especially smooth. In fact, since 


0A = (te I: F(t) = u} 


the suitable regularity of F ensures that ðA is a (N — 1)-dimensional manifold 
of class C?. Essentially, what this means is that to each small enough open 
subset of 0A there corresponds an open set E c #7! and an invertible func- 
tion g: E ^ & such that this subset is simply the range of g over E, and the 
coordinate functions of g are twice differentiable with respect to their (N — 1) 
parameters. 

The general theory of compact sets in 4" bounded by C? manifolds is the 
domain of differential topology, and in that subject there is a well-defined 
functional, which we shall denote by ọ, from the class of all such sets to the 
integers. The functional, when evaluated on a particular set, is known as the 
Euler, or Euler- Poincaré, characteristic of the set. The Euler characteristic of a 
set is defined via an alternating sum, similar in form and substance to the 
summation of (4.2.3) that defined the Hadwiger characteristic. The two func- 
tionals are not equivalent, for they are defined over distinctly different classes of 
sets. Nevertheless, if a given set lies within the domain of definition of both 
functionals, then it is relatively straightforward to see that its two character- 
istics are identical. 

Excursion sets of the type we are currently considering (i.e. F suitably regular 
and A ^ ôl, = $) fall into both domains of definition, so that although we 
originally motivated the introduction of the functional g on integral geometric 
grounds, wecan now turn to the vast theory of differential topology to investigate 
its properties. In this setting, however, there is nothing particularly simple about 
excursion sets over the unit square, and we can look at excursion sets over 
general compact S c 4" without difficulty. We shall do this. 

We need to start with some terminology and notation. Let f(t), te 2”, 
be a real-valued function of class C? on an open subset of 2” (ie. f(t) has 
continuous partial derivatives of up to second order on this set). A critical point 
t* (i.e. a point where f(t*) = 0,j = 1,..., N) will be called ND (non-degenerate) 
if the Jacobian 


D( fis fo... fn) 


(4.4.1) = det[ f;(t*)] 4 0. 
N 
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If each critical value of fis ND, f itself will be termed ND. We define the index 
of a critical point t* of fto be the number of negative eigenvalues of the matrix 
(fi(t*)), ij = 1,2,..., N, counted with their multiplicities. 

We shall call a compact subset Z of 2" a regular C? domain in A if it is 
bounded by a regular (N — 1)-dimensional manifold X of class C?, where 
x = 0Z is simply the boundary of Z. (See Morse and Cairns, 1969, Chaps. 
1-10, for more formal definitions.) There is no need to assume that either € 
orZ is connected, but we do insist that they have a finite number of components. 
By X. we shall denote the submanifold of points t e X for which the directional 
derivative of f in the direction of the outward normal to X at t is negative. 
Finally, we shall call f admissible relative to a regular C?-domain Z if f is of 
class C? on an open neighbourhood of Z, if f has no critical points on X, and if 
the restrictions of f to Z and X, f |Z and f |È, are both ND. 

Let us denote by m,, k = 0, 1,..., Nm, k = 0, 1,..., N — 1), the kth type 
number of f | Z( f |Z_); i.e. the number ofcritical points off | Z( f |£ _) of index k. 
Arguing as in the proof of Lemma 4.3.3, it is easy to show that the m, and m, are 
all finite if fis admissible relative to Z. The following important result is due to 
Morse and Cairns (1969, Theorem 10.2’) and is of major importance both to 
differential topology and our current interests. 


Theorem 4.4.1 | (Morse's theorem) 


Let f (t), te 4" be any real-valued function of class C^, admissible relative to a 
regular C? domain Z in 2“. T hen 


N N-1 
(4.4.2) e(Z) = Y (C U'm + } C- Dm, 
k=0 k=0 
where @(Z) is the Euler characteristic of Z. 


The proof of this result is both long and involved, and fills most of the mono- 
graph from which we have quoted it. Since the proof rests heavily on notions 
such as homology groups, it is not possible to give even an outline of it without 
firstly developing a substantial background. 

The main value of the above theorem, from our point of view, is that it leads 
to a point set representation for the Euler characteristic of excursion sets, 
since under the conditions we are currently imposing on F, the set A is itself 
a regular C? domain. To apply Theorem 4.4.1 we need only define over A an 
appropriate function f, for which we choose simply the Nth coordinate func- 
tion, i.e. 


(44.3) A(t) = ft... ty) = ty. 


We must establish that f is admissible relative to A, as in the following 
Lemma. 
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Lemma 4.4.1 


Let F: 2% > R! be suitably regular with respect to a compact S c 2" at 
the level u, with the excursion set A = A,(F, S). Then if A ^ 08 = $ the Nth 
coordinate function is admissible relative to A. 


Proof 


The function f, as defined by (4.4.3), is clearly of class C? and has no critical 
points anywhere in Z since f,(t) = 1 for all t. Thus f will be admissible if 
f |OA is ND, ie. the non-degeneracy condition (4.4.1) is satisfied by the restricted 
function at its critical points. 

To show this, we first note that since fy(t) is never zero, the implicit function 
theorem (Lemma 4.3.2) tells us that any t € ðA has a neighbourhood on 0A 
in which the Nth coordinate of each point can be expressed as a function of the 
first (N — 1) coordinates. This function, g say, is determined by 


(4.4.4) F(t,,... tyi g(t +--> ta) =U 


and it is obvious that, locally, f |OA is identical to such a function. Since F 
possesses continuous second-order derivatives, g is of class C?. Furthermore, 
since, for j = 1,..., N — 1, 


0g | Fj 
Ot; Fy 
it follows that t e GA will bea critical point of g (equivalently, f | A) if and only if 
Ft) 
= 0, forj = L...,N- 1. 
Fy(t) i 


Since F is suitably regular, (3.1.2) implies that if F(t) — u = F,(t) =- = 
Fy- ,(t), then F,(t) # 0, so that a t e 0A will be a critical point of g if and only if 


(4.4.5) F(0—-0,  forj=1,...,N— 1. 


To complete the proof we need only show that the (N — 1) x (N — 1) 
matrix with elements 0g/0t;0t;, i, j = 1,..., N — 1, has a non-zero determinant 
at all points on 0A satisfying (4.4.5). But in view of (4.4.4) and the implicit 
function theorem the elements of this matrix are equal to 


—FyFy, forij=1,...,N—1, 
so that its determinant is given by 
(- D" "[FA()]^ ' det[D(0] 


in the usual notation. But condition (3.1.4) of suitable regularity states that this 
is non-zero, so the proof is complete. 


4.4 DIFFERENTIAL TOPOLOGY AND THE CHARACTERISTIC X 89 


We are now in a position to apply Morse's theorem with the Nth coordinate 
function f. Firstly, however, we note that since f itself has no critical points, the 
type numbers m, appearing in (4.4.2) are all zero, while, by Lemma 4.3.3 and the 
above, the type numbers m, of f |A are finite. To determine the actual values of 
the m; we look at the function F itself. 

It is straightforward to see that at points on 04 at which F,=0, 
j=1,2,...,N — 1, the outward normal to @A is parallel to the N vector 
(0, 0,..., — Fy), so that since fy = 1 throughout A" if follows from the definition 
of type numbers that those critical points of f |OA contributing to the m; will 
be those at which Fy is positive. Noting the previously mentioned fact about 
the matrix of second-order derivatives of f|@A at these points (viz. the de- 
terminant of this matrix equals (— 1) ^ ! FA^ ! det D), we have that each type 
number m, can be specified as follows. With some abuse of terminology, we 
now say a matrix has index k if it has k negative eigenvalues. 


Lemma 4.4.2 


The kth type number m, of f |OA when F(t): A > 9?! satisfies the conditions 
given above is equal to the number of points t € S for which the following conditions 
hold: 


(a  F(0-w 
(b) F(t—-0 j=1,2,...,N-1, 
(c) | F«0»0 
(d) N odd: the index of D(t) equals k, 
N even: the index of D(t) equals N — k — 1. 


Furthermore, the type numbers m, of f | A are all zero. 


The following result is of major importance in terms of providing a basis for 
the definition of a new characteristic of excursion sets which we shall introduce 
soon and in providing the basis for probabilistic results to come in the following 
chapters. The truth of Theorem 4.42 is an immediate consequence of Theorem 
4.4.1 and Lemmas 4.4.1 and 4.4.2. 


Theorem 4.4.2 


Let F: A“ — 9?! be suitably regular with respect to some compact set S c &™, 
and assume that F(t) < ufor t e 0S. Then the Euler characteristic of the excursion 
set AF, S) is given by 


N | 
PAY - CD ! Y Dink, 


Án 


90 GEOMETRY AND EXCURSION CHARACTERISTICS 4 
where m is the number of points t € S satisfying the following conditions: 

(a) «= Ft) =u, 

(b) Fí(t 2-0  j2L2..,N—1 

() . Fy(t) > 0, 

(d) the index of D(t) equals k. 


Note that because of the alternating sum (4.4.2) in Morse’s theorem the 
distinction between odd and even values of N that appears in condition (d) 
of Lemma 4.4.2 can be put in a simpler form for the corresponding condition of 
the above theorem. 

Thus Theorem 4.4.2 provides us with a way of expressing the Euler character- 
istic of an excursion set in terms of the local behaviour of the function F, 
provided F(t) « u when te0S. This condition is not in general satisfied, 
however, and there is no analogue ofthis result for the general case. Nevertheless, 
the result of Theorem 4.4.2 motivates the following definition for a new character- 
istic y(A) of excursion sets. 


Definition 4.4.1 


Let F: A“ = 9! be suitably regular with respect to some compact subset 
S c J^. Then we define the DT (differential topology) characteristic of its 
excursion set AF, S) to be the number y(A), defined by 


N-1 

(4.4.6) XA) = (CD Y, (~ 1x), 
j=0 

where the y ( A) are identical to the m; of Theorem 4.4.2. 


Note that this definition of y(A,) is dependent on a particular coordinate 
system, and changing this system (e.g. by rotation) can change the value of 
Y(A,). If, however, A, ^ 0S = ¢, so that y(A,) is merely the Euler characteristic 
of A,, it can be seen as a consequence of Theorem 4.4.1 that the definition (4.4.6) 
is, in fact, independent of the coordinate system. 


45 SOME PROPERTIES OF y AND T 


Having defined two distinct excursion characteristics we shall now take a 
moment to compare them and consider some of their simpler properties. 

Both the IG characteristic l and the DT characteristic y are based on the 
Euler Poincaré characteristic of the excursion set A (F, /,). The relationship 
between them is simple. In one dimension, both are equal to the number of 
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upcrossings of u by F in [0, 1]. If A, ^ ĉl, = $, they are identical and equal to 
the Euler characteristic of A,. If A, ^ 01, # d, then their respective point set 
representations differ according to the number, and type, of points on ôl, 
satisfying 


F(t) —u-— 0. 


In the latter case, it is the IG characteristic that has the marginally clearer 
topological meaning. 

In practice, however, these differences are of minimal importance, and the 
most important difference lies in the fact that there exists a point set representa- 
tion for y in all dimensions, while I has such a representation only in two di- 
mensions. Thus, in future, we shall tend to use y in preference to T, except in 
exclusively two-dimensional problems, where we shall still prefer T. 

The main value of both of these characteristics, however, lies in the random 
setting, when the underlying function is a Gaussian random field, for then it is 
possible to obtain an explicit expression for E{y(A,(X, 1,))) and, when N = 2, 
E(T(A,(X, I,))}. This has not proved possible for any other functionals defined 
on excursion sets. The evaluation of these expectations forms the content of the 
following chapter. Before we turn to this task, however, we shall prove one 
rather interesting and extremely useful result about the DT characteristic. 

Let V be an invertible mapping from I, c 4 onto some set J’ in 2", with 
coordinate maps y^, k = 1,..., N, such that the Nth coordinate mapping is the 
Nth coordinate function, i.e., 


(4.5.1) VI (ts. esty) = ty- 
Suppose, furthermore, that each y* possesses finite first- and second-order 
partial derivatives wi, Vf, and that the (N — 1) x (N — 1) matrix 


(4.5.2) A(t) = (Jt), uci 


has a finite, non-zero determinant for every t € ],. Then we have the following 
lemma. 


Lemma 4.5.1 


Let Y: I, — T be as above and let F: I, + 9?! be suitably with respect to I, at u. 
If G: I' ^ 9! is defined by 


(4.5.3) Gt) = F(t), 


then the DT characteristic of A (G, T’), as defined in Definition 4.4.1, is identical 
to that of ALF, L,). Furthermore, 


(4.5.4) x A(G, 0) = YALE 0) 
foreveryj = OV... 6! N |. 
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Proof 


Choose a point t e I, with the image point s = W(t) e J’. Then according to 
the chain rule for multiparameter functions (e.g. Apostol, 1957, p. 113) we have 


(4.5.5) FAQ = È VOGO) 
= X VOG) 
and 
(4.5.6) FAQ = E OGO + E Y VOGO. 


Let D,(t) and D,(s) denote, respectively, the (N — 1) x (N — 1) matrices 
with elements F;(t) and G,{s), i, j = 1,...,N — 1. Now suppose that se I’, 
with the preimage t € 1,, is one of the points contributing to y,( A,(G, I^). 

That is, s satisfies 


(4.5.7) G(s) — u = G,(s) = --- = Gy (8) = 0, 
(4.5.8) Gy(s) > 0, 
(4.5.9) Index Dg(s) = k. 


Note that (4.5.1) implies that, for every t, vt) = 1 and W(t) = O ifj z N. 
Then (4.5.3) and (4.5.5) imply that if (4.5.7) and (4.5.8) are satisfied 


F(t) -u = Fi =--- = Fy-,(0 -0 


and 
Fy(t) = Gy(W(t)) > 0. 


However, (4.5.6) and the above conditions easily imply (since y(t) = 0 for all 
i,j = 1,..., N) that 

D;(t) = A(t)DG(s)A'(0, 
from which it immediately follows from the fact that A is an invertible symmetric 
matrix (see, for example, Perlis, 1958, p. 93) that D,(t) and D,(s) have the same 
index. 

Thus we have now shown that y (A,(G, I)) 2 x((AKF, /,)) for each j, and the 
reverse inequality is easily established simply by writing F(t) = G(V ^ !(t)) and 
repeating the above argument. This of course, establishes (4.5.4) and thus the 
lemma. 


This lemma will be extremely useful later on, for its content is that appro- 
priate transformations of the parameter space of a function do not change the 
DT characteristic of its excursion sets. Howcver, if the underlying function is a 
random field such a transformation can markedly change its covariance 
structure and thus greatly simplify probabilistic computations. 


CHAPTER 5 


Some Expectations 


5.1 A GENERAL RESULT 


The main aim of this chapter is to obtain expressions for the mean values of 
the IG and DT characteristics of excursion sets of certain random fields. There 
is a variety of approaches that can be taken to solve this problem and a variety 
of technical methods that will yield a solution. We shall adopt an approach 
which may at first seem to be unnecessarily general, but will ultimately yield 
solutions to problems which will not be posed until later chapters. 

To develop this approach, consider for a moment the IG characteristic of a 
two-dimensional random field X(t), i.e. I'(A,(X, I,)) for some u. Suppose we 
wish to compute the mean value of this characteristic. From the discussion of 
Section 4.3 it follows that computing this expectation is equivalent to computing 
the expectations of random variables such as the number of points t€ I, 
satisfying 


(5.1.1) X(t) —u— X,(t -0 
and 
(5.1.2) X (t) > 0, Xi «0 (or X,,(t) > 0). 


In higher dimensions, the definition of the DT characteristic of excursion 
sets involved similar expressions, except that for those cases (5.1.1) was replaced 
by 

X(t) — u = X,(t) = Xt) =--- = Xy_,) = 0, 


while (5.1.2) was replaced by X (t) > 0 and a condition on the second-order 
partial derivatives related to the index of the matrix 


Dit) = (X; (0), ij = 1; = 1. 


A way to write this in a general form would be to proceed as follows. Firstly, 
define two vector fields, X(t), Y(t), by setting 


X(t) = [ X(t) — u, X,(t)..... Xy_ (0), 
Y(t) = [ X (t), Xi (0,0... My I.N ((01. 
91 
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Then X has N elements, while Y has K = [N(N — 1)/2 + 1] elements. To 
proceed to the second stage, let Ag, k = 0,..., N — 1, be the set in ZF defined by 


Y(t) e A, X t) > 0 and the index of D(t) is k. 


Then to compute the DT characteristic of an excursion set all we need to know 
is the number of points t € J, satisfying 


X(t)=0 and Y(t)eA, 


for each k between 0 and N — 1. 

When we write our problem in this form, it becomes reasonably easy to see 
what a general formulation should look like; viz. given two vector-valued 
random fields, X: 4" — 4?" and Y: A“ — B®, aset A c AF, find an expression 
for the expected number of points t € J, for which 


(5.1.3) X()-0 and Y(theA. 


Although obtaining a full solution to a problem of this generality is by no 
means impossible, it is undesirable from at least two points of view. Firstly, in 
order to cover all possibilities, it becomes necessary to place a multitude of 
awkward and often undesirable conditions on X, Y and A that obscure the 
basic result. Secondly, the proof becomes so involved that the details obscure 
the basic technique involved. For these reasons we shall proceed by first 
establishing a weak result that only gives us the mean number of points satisfy- 
ing (5.1.3) under rather strong conditions. This is the content of Theorem 5.1.1 
below and its ‘corollary’. Although the result is weak, it suffices for most 
practical situations. More important, though, are the facts that its proof is 
relatively easy to follow and that it easily provides an expression for the ex- 
pectation we are seeking that acts as a guide when we try to obtain the same 
result under weaker conditions using less transparent derivations. 

To state the result we need some notation. As above, X = (X1,..., X") 
and Y = (Y!,..., YE), respectively, will denote ~- and #*-valued N-di- 
mensional random fields. Furthermore, A will denote some subset of 25, and 
0A its boundary, while S will be a compact subset of 4" with boundary 08. 
We shall write VX(t) to denotethe N x N matrix of first-order partial derivatives 
of X at t; ie. 

Ox'(t) 
VX(t) = ( à 


J 


) for i,j 2 L...,N. 


Furthermore, we write I, for the indicator function of A; i.e. 


1 if y € A, 
Ly) = 
4) n if y é A, 


while, for any > 0, o(c) is the ball of radius c defined by 


a(t) = {x e AN: xl « e} 


5.1 A GENERAL RESULT 95 


and 6,(x) is a function on & defined to be constant on o(c) and zero elsewhere, 
normalized so that 


6,(x) dx = 1. 


e(t) 


We can now prove the following theorem. 


Theorem 5.1.1 


Let X, Y, A and S be as above, and suppose the following conditions are satisfied 
with probability one: 


(5.1.4) The components of X, Y and VX are all continuous. 


(5.1.5) There are no points t € S satisfying both X(t) — 0 and either 
Y(t) € 0A or det VX(t) = 0. 


(5.1.6) There are no points t € OS satisfying X(t) = 0. 


(5.1.7) There are only a finite number of points t€ S. satisfying 
X(t) = 0. 


Then if N(S) denotes the number of points t € S for which X(t) = 0 and Y(t)e A, 
we have, with probability one, that 


(5.1.8) N(S) 2 lim faao 4A(Y(t))|det VX(t)|dt. 
£20 «S 


Proof 


We start by choosing a realization of X which satisfies conditions (5.1.4) 
to (5.1.7) above. Consider those te $ for which X(t) = 0. Since there are only 
finitely many such points and none lie on 0S, each one can be surrounded by an 
open sphere, of radius 4, say, in such a way that the spheres neither overlap 
nor intersect 0S. Furthermore, because of (5.1.5) we can ensure y is small enough 
so that within each sphere Y(t) always lies in either A or its complement, but 
never both. 

Let o(e) be the sphere |X|| < € in the image space of X. From what we have 
just established we claim that we can choose £ small enough for the inverse 
image of o(c) in S to be contained within the union of the ņ spheres. (In fact, if 
this were not so, we could choose a sequence of points t, in S not belonging to 
any 4 sphere, and a sequence e, tending to zero such that X(t,) would belong to 
o(é,) for each n. Since S is compact the sequence t, would have a limit point t* 
in S, for which we would have X(t*) = 0. Since t* ¢ OS by (5.1.6) we must have 
t* € S. Thus t* is contained in the inverse image of o(c) for any £, as must be 
infinitely many of the t,. This contradiction establishes our claim.) 
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Furthermore, by (5.1.5) and the inverse mapping theorem (Lemma 4.3.1) 
we can choose £, 7 so small that, for each y sphere in S, o(e) is contained in the X 
image of the y sphere, so that the restriction of X to sucha sphere will be one-one. 
Since the Jacobian of the mapping of each y sphere by X is |det VX(t) | it follows 
that we can choose £ so small that 


N(S) = [scxeonr aCY(t))|det VX(t)|dt. 


This follows since each y sphere in S over which Y(t) e A will contribute exactly 
one unit to the integral, while all points outside the y spheres will not be mapped 
onto a(e). Since the left-hand side of this expression is independent of £ we can 
take the limit as £ — 0 to obtain (5.1.8) and thus the theorem. 


The theorem, as it currently stands, does not yet tell us anything about 
expectations. Ideally, it would be nice simply to take expectations on both sides 
of (5.1.8) and then, hopefully, find an easy way to evaluate the right-hand side 
of the resulting equation. This, however, cannot be done without further 
assumptions, and it is precisely at this point that the generality of the current 
problem leads to the awkward and often unnatural conditions that we mentioned 
earlier. Nevertheless, let us proceed for the moment, and see what happens if 
we do simply take expectations as suggested. We then have 


E{N(S)} = lim E faxo aC Y(t))|det VX(t)|dt 
£20 S 


Š Í Í Í Iy) det vac im 8,606 Q06.(Vx, yx) ax} 
SJ RNIN +1)/2J RK £204 @N 
x dVx dy dt, 


where the ¢, are the obvious densities. Taking the limit in the innermost integral 
yields 


(5.1.9) E{N(S)} = (ff a(y)|det Vx | $,(0)0, (Vx, y|0) dVx dy dt 


= i E(|det VX(t)|14(¥(0)|X(t) = 0),(0) dt. 


Of course, interchanging the order of the integration and limiting procedures 
requires justification, and we shall obtain results like (5.1.9) in a rigorous 
fashion in the following section. Nevertheless, at this point we can state the 
following ‘corollary’ to Theorem 5.1.1, in which the h, are as above. 
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Corollary 

If the conditions of the theorem hold, as well as ‘adequate’ regularity conditions, 
then 


(5110) E{N(S)} = iEn NN Vx|- L(y) ¢.(0, Vx, y) dVx dy dt 


= Í E(|det VX(0] - L(Y()|X(0 = 0}4,(0) dt. 


As will be seen in the following sections, ‘adequate’ regularity conditions 
generally require no more than that the density $, above be well behaved (Le. 
continuous, bounded) and that enough continuous derivatives of the X and Y 
fields exist with enough finite moments. 

At this point we suggest that the non-mathematical reader turn directly to 
Theorem 5.22 in which ‘adequate’ conditions are specified, and the above 
result is used to obtain an expression for the mean value of the DT character- 
istic of excursion sets. He will thus spare himself the rather involved mathe- 
matics required to justify the limiting procedures we have so freely undertaken 
above. Indeed, there is probably much to be said in favour of even the mathe- 
matical reader skipping this detail on his first reading. 


5.2 AN EXPRESSION FOR THE MEAN VALUE OF y 


We now face the task of rigorously evaluating E{y(A,(X, S)} for random 
fields X. We shall assume, throughout this section, that X is an N-dimensional 
random field suitably regular with respect to the compact set S at the level u, 
and that 0S has zero Lebesgue measure. Furthermore, X will be assumed to be 
homogeneous. Then 7(A,(X, S)) was defined in the preceding chapter by 


N-1 
XA) = C- 0"! $ (7 De, 
k=0 
where y, is simply the number of points t € S satisfying 
(5.2.1) X(t —u- X,(t —:-— Xy 1(0-0 
(5.2.2) X(t) > 0, index of D(t) equals k. 


The matrix D(t) is, we recall, the (N — 1) x (N — 1) matrix with elements 
X, i,j = L...,N— I. 

Theorem 5.1.1 can now be used in this setting by identifying the vector X 
of the theorem with the vector (X — u, X,,..., Xy 4), while Y becomes the 
[N(N — Dy2 + l] vector with elements Xy and X;j, i,j = L.... N — 1. Since 
we shall need to use this result N times, there will be N regions, 49,..., Ay. 4 
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The kth region, A,, will be defined as the region where (5.2.2) is satisfied. Finally, 
to simplify notation, let us write M(t) to denote the matrix VX(t) given below: 
X,(t) X(t) 250 X1n-1(t) 

X Xt) X(t) (x X, x- Y (t 


VX(t) = M(t) 


Xt) Xx Xy ya 
x: 
XA): Dit) 


XO Xyi(0 --- Xy, ya) 


It then follows from the suitable regularity of the random field and Lemma 
4.3.3 that all the conditions of Theorem 5.1.1 are satisfied, so that 


(5.2.3) X(A) = no XA) 
where 
(5.2.4) XA) = f 5XL (Y(0)|det MO) ldt. 


In principle we can now proceed to use (5.2.3) to evaluate E{y,(A)} by, for 
example, placing high enough moment conditions on xg. This approach was 
originally adopted by Adler and Hasofer (1976), and the resulting computation 
is both algebraically awkward while at the same time demanding unnecessarily 
strong conditions on the field X for the computation to work. Better results can 
be obtained by using (5.2.3) only to obtain an upper bound on E(X (A) while 
obtaining an (identical) lower bound via a completely different route. We 
shall proceed in this fashion by firstly proving the following lemma, in which we 
write $(x, x,,..., Xy, Z) for the joint density of X, X,,..., Xy, Z, where Z is the 
vector of length N(N — 1)/2 obtained from D(t) by placing the successive 
columns on and above the main diagonal of D(t) under one another. 


Lemma 5.2.1 


Let X be as described above. Then if the second-order partial derivatives of X 
have finite variances, $ is continuous in each of its variables, and the conditional 
density of (X, X,,..., Xy .,) given Xy and the second-order partial derivatives 
Xi, 1<i<N,1<j< N — is bounded above, the mean value of the kth 
component x, of the DT characteristic of the excursion set A (X, I.) is bounded 
above as follows: 


(5.45) E{y,(A,)} x ASX- 1)* [stet d)ó(u, 0,...,0, xy, Z) dxy dz, 


xn c0 
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where the second integral is over all z e 9 - 9? for which the matrix d has k 
negative eigenvalues. 


Proof 


We start, unfortunately, with some more notation. We write X'(t) for the 
vector of length N with elements X,(t),..., Xy(t) and X'(t) for the vector of 
length N(N'+ 1)/2 — 1 with elements X;(t) ji i—-12,...,.N, j=1, 
2,...,N — 1. Let W(x, x’, x") denote the joint density of LX, X', X"]. Then by 
(5.2.4) we have, for any fixed c > 0, 


(5.2.6) E{y(A)} = Í dt faldet m(t)| W(x, x’, x”) dx dx’ dx”, 
s 


where the second integral is over all x, x,,..., XN- Xnpj = 1,2,..., N — Las 
well as xy > Oand all values of x;;, 1 <i < j € N — 1, for which the submatrix 
d of m (defined analogously to D and M) has index k. The integral interchange is 
justified by Fubini's theorem. Because of homogeneity the inner integral in 
(5.2.6) is independent of t so that E(y;,(4)) simply equals this inner integral 


multiplied by the Lebesgue measure of S, A(S). Thus 
(5.2.7) 


E(4(A)) = HS) Í dx" dxy 


x IL — u, Xy... Xy ))ldet m] w(x, x’, x") dx dx,, ..., dxy. i. 
By expanding det m the inner integral is easily seen to converge to 
(5.2.8) (— D*xy det d y(u, 0,...,0, xy, x" 


as £ > 0, where the alternating sign comes from the expansion of det m and the 
fact that since d has k negative eigenvalues |det d| = ( — 1)* det d. Furthermore, 
as we shall now show, this inner integral is bounded by an integrable function, 
so that applying the dominated convergence theorem, Fatou's lemma, and 
integrating out xy;,j = 1, 2,..., N — 1, it follows from (5.2.3) that 


EG (4) € lim EC) 
—(—IFA) Í E det d y(u, 0,...,0, xy, Z) dxy dz, 


where the second integral is over those ze RW? for which d has k negative 
eigenvalues, This. of course, proves the lemma. 
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To obtain the bounding function we note that the inner integral of (5.2.7) 
is not greater than 


(5.29) [BÐ] Yxn, x^) ju f Eo L. a 


X—H-—E 
N 
N xil larly, x1... Xy c ilXy, X) dx dxy,...,dxy—4, 
k-1 


where wW, and y, are, respectively, the joint probability density of (Xy, X") and 
the conditional density of (X, X,,..., Xy ,| Xy, X"), By(e) is the volume of o(c) 
(so that By(e) = Ky" for a finite constant Ky), and a, is the cofactor of m,, in 
m. Each a, is a quadratic form in the x;; and, in particular, |ay| = |det d|. 

Since the conditions of the theorem imply the existence of a finite C for which 
V1 «C, by choosing e < 1 (so that |x,|<1,k=1,..., N — 1, in 5.2.9) we can 
ensure that (5.2.9) is bounded by 


N-1 
CK yaen x X [al t [xx]: Idet dl); 
k=1 


This expression is clearly integrable since each of the variables appearing in it is 
assumed to have finite variance. This completes the proof. 


We can now turn to the more difficult part of our problem: showing that the 
upper bound for E(y,(A,)) obtained in the preceding lemma also serves as a 
lower bound under reasonable conditions. We shall derive the following result. 


Lemma 5.2.2 


Suppose that the assumptions of Lemma 5.2.1 hold and, furthermore, that S is 
convex and the moduli of continuity c, w;; of the X;, X;; satisfy the following 
condition for any £ > 0: 


(5.2.10) P [maxi wh] > 1 —o(h") ashl0. 


i,j 


Then (5.2.5) holds with the inequality sign reversed. 


Since the proof of this lemma is rather involved, we shall start by first de- 
scribing the principles underlying it. Essentially, the proof is based on construct- 
ing a pathwise approximation to the vector-valued process 


X(t) = (X(O — u, X,(t..... Xy-,(t) 


and then studying the zeros of the approximating process. By obtaining a 
suitable approximation it is possible to conclude, under the restrictions on the 
moduli of continuity of X and its partial derivatives, that to certain zeros of the 
approximating process in $ there are corresponding zeros of X(t). In essence, the 
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Figure 5.2.1 The approximation in #'; u = 0 


approximation is based on obtaining a fine partition of S and then replacing 
X(t) within each cell of the partition by a hyperplane passing through the value 
of X at the midpoint of the cell, We then argue that if the approximating process 
has a zero within a certain subset of a given cell then X(t) has a zero somewhere 
in the full cell. 

In one dimension, for example, we replace the function X(t) — u on [0, 1] 
by a series of approximations Xt) given by 


Xo()- XG + D27) + E- 60-927]: X(G +427”) — u, 
j2"zt«(j-12" 


and study the zeros of Xt) as n> oo. Although this is not the common 
approximation used in one dimension it generalizes far more easily to higher 
dimensions than does the usual one. (See Ylvisaker, 1965, and Cramér and 
Leadbetter, 1967, for full accounts of the optimal procedure in this case.) An 
example of this approximation in 2?! is given in Figure 5.2.1, 

Before we can proceed with the details of the proof we need to state the 
following classical result of analysis, known as the Brouwer fixed point theorem. 
Proofs of this result are easy to find (e.g, Smart, 1974, p. 11). 


Lemma 5.2.3 


Let S be a compact, convex subset of AN and f : S — S a continuous mapping of S 
into itself. Then f has at least one fixed point; i.e. there exists at least one point 
te S for which f(t) = t. 
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We also need the following definition and lemma, a proof of which can be 

found, for example, in Apostol (1957). 

Definition 5.2.1 


If s and t are two distinct points in A~, then by the line segment joining s and t 
we mean the set 


L(s, t) = {u:u = 0s + (1 — 00 «0 < 1). 


Lemma 5.24 (Mean value theorem for 2") 


Let S be a bounded open set in A~ and let f: S > 9" have first-order partial 
derivatives at each point in S. Let s and t be two points in S such that the line 


segment L(s, t) c S. T hen there exist points u!,. .. , u" on L(s, t) such that 
f(t) — f(s) = Vf(u!,...,u") -(t— s) 
where Vf(u!, ... , u~) denotes the matrix function Vf with the elements in the kth 


column evaluated at the point u*. 


Finally, we must set down the following notation. For each n > 1 we let L, 
denote the lattice of points in 4" whose components are integer multiples of 
27"; Le. 

L, = {te 2" :t,— i. 2 "^j L...,N,i-0 £1, +2,...}. 
Two hypercubes centred on an arbitrary point t are defined by 
A,(t) = (se A`: |s; — t| 27 "* j= L..., N}, 

At) = {se A~: is t| x(1— 92 "* Sj = L..., NY 


Proof of Lemma 5.2.2 


The proof now commences by noting that we can restrict our attention to small 
subsets of S and consider each one singly. Choose n > 0 andi e L,. Now set 


bus 1 if x(A ^ A)) 20 
" ]lO  ifg(AnmA()-0 
and define approximations yj(A) to y,(4) by 


HWA) = X In. 
ie L4 
Then suitable regularity can be easily seen to imply Piy,(0A,(i)) = 0} = 1 
for every n, i, implying P{z,(\_) 0^,0)) = 0} = 1 so that 


XA) S YA) ds Ho s. 
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Furthermore, it is easy to see that the sequence y; is non-decreasing in n, so 
that the monotone convergence theorem yields 
(5.2.11) E(x(A)) = lim E{xK(A)} 


> lim 2P{y,(A n A) > 0} 
where the summation is over all i € L, for which A,(i) c S. 

We shall now investigate the limiting value of P{yj(A 7 A,(i)) > 0} by using 
the approximation to X mentioned earlier. To do so, let ô and ¢ be two arbitrary, 
small, positive real numbers, and K an arbitrary large positive number. For a 
given realization, X, of the random field define the function cot(n) by 


ox(n) = max| max o(2 ^", max o2 ") 
1<j<N 1si,j<N 


where the œ; and c; are, as usual, the moduli of continuity of the X; and X;; 
over S. Furthermore, define My by 


Mx = max| max sup|Xt)|, max pix . 
1<j<N teS 1<i,j<N teS 
Finally, define y to be the number 
E Se 
~ ANNUK + DN 2° 


n 


Then the conditions of the theorem imply that, as n > oo, 
(5.2.12) P(ot(n) > n} = o(2 


Choose now a fixed n and t € L, for which A,(t) c S. Assume that w¥(n) < 4 
and the event G5, occurs, where 
Gsx = {|det D(t)| > ô, X\(Q > 6, My < K, index D(t) = k}. 


Define t* to be the solution of the following equation, when a unique solution in 
fact exists: 


(5.2.13) X(t) = (t — t*)- M(t). 


Recall M(t) = VX(t). (See the beginning of the section.) 
We claim that if of(n) < 4 and G5, occurs, then t* € A(t) implies 
X(A 0 A(0) > 0 


if n is large enough. If this were truc, then by choosing n large enough for (5.2.12) 
to be satisfied it immediately follows that 


($2.14) | Piy,(A At) = OF > PIG O Lt Ap o nh. 
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To establish our claim we need first show that M(t) is invertible for these realiza- 
tions. It follows simply from (5.2.13) and the condition on Mx that t* € At) 
implies that each of the elements of X(t) has absolute value less than 
2^"KN(1 — «)/2. But the last N — 1 elements of X(t) are the first N — 1 elements 
in the first column of M(t). Thus expanding det M(t) down its first column and 
using the conditions of Gg we note easily that 

2 


(5.2.15) [det M(t)| > 5. 


as long as nis large enough (2^" < ó?/[K"(N — 1)N!(1 — e)]). We shall assume 
hereafter that n is in fact large enough for (5.2.15) to hold. Then in view of 
(5.2.13), t* € Af(t) is equivalent to 


(5.2.16) t — f(t) M (t) e At). 


Now let t be any other point in A,(t). It is easy to see that 


2 
[det D(t) — det D(t)| < 2 


under the conditions we require. Thus, since det D(t) > ô, it follows that 
det D(t) z 0 for any t € A,(t). From this it is immediate that index D(t) = k 
throughout A,(t). Similarly, we must have that X,(t) > 0 throughout A,(t), 
so that if there exist one or more points t € A,(t) at which X(t) = 0 we must 
have y,(A ^ A(t)) > 0. Thus we need only show that t* e Az(t) implies the 
existence of at least one t € A,(t) at which X(t) = 

The mean value theorem (Lemma 5.2.4) allows us to write 


(5.2.17) X(t) — X(t) = (t — t). M(t’,..., t”) 


for some points t!, .. ., t" lying on L(t, t), for any t and t, where M(t’, ..., t") is 
simply the matrix function M with the elements in the kth column evaluated 
at the point t*. From (5.2.15) and the conditions of Gx it follows, via similar 
arguments to the above, that if t € A,(t) then the matrix M(t’, ..., t") of (5.2.17) 
has a non-zero determinant and is thus invertible. Hence we can rewrite (5.2.17) 
as 


(5.2.18) X(t)- M^ «(t!,...,t) = XX0M et... t) te € 


Suppose we could show that X(€M ^ !(t!, ..., t") e A,(0) if t* € A&(t). Then 
by the Brouwer fixed point theorem (Lemma 5.2.3) it would follow that the con- 
tinuous mapping of A,(t) into A,(t) given by 


tt — X(0M (tl,..., t), te A(t), 


has at least one fixed point. Thus, by (5.2.18), there would be at least one 
t € A,(t) for which X(t) = 0. In other words, 


(5.2.19) — fe f(n) < n. Gay, t* € ALD, n large? = zt o A0) > 0. 
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But X(M  (t,..., t") € A,(0) is easily seen to be a consequence of t* e AX) 
and G, simply by writing 
X(0M ' (t!,...,t) = [X(0- M^ (0]- MO - M- Kt, ..., t]. 
= [X()M O] + (M(t) — M(t.,..., t))M (t,..., €)] 
and noting (5.2.16), then bounding the second term as above. 


Thus we have now succeeded in establishing the validity of (5.2.15). The 
remainder of the proof is quite straightforward. From (5.2.15) we have 


P{n(A A A(0) > 0} 


> 


Í |det m(t) | Y(t ga t*) : m(t), XN, xi) dt* dx; D o(2-"*), 
Gox n (t*eAR(t) 


wherel ix N,1xj € N — l, y, is the joint density of the vector-valued 
variable (X — u, X;, Xi 1i & N, 1 Ej € N — 1), and we have performed 
the transformation given by (5.2.13). If we now let n oo it follows from the 
mean value theorem for the integral calculus that the last expression, multiplied 
by the number of terms in the summation (5.2.11), converges to 


AG) — 2)" Í 


Go 


det d(t)| - x XOU, . . - , 0, xy, Xi) dxy Axi). 


Letting e - 0,ó > 0, K — oo and applying the monotone convergence theorem 
to the above expression we obtain, from (5.2.11), 


(5.2.20) E(y4A)) = (—1)*A(S) Í fxs det dó(u, 0,...,0, xy, Z}>dXy dz 
xn>O 


on returning to the standard notation. This, of course, completes the proof of 
Lemma 5.2.2. 


Combining Lemmas 5.2.1 and 5.2.2 immediately yields the following im- 
portant result. 


Theorem 5.2.1 
Under the conditions of Lemmas 5.2.1 and 5.2.2, 


(5.2.21) E{y,A,)} = ASX- 1)* fedet d)ó(u, 0,..., xy, Z) dxy dz, 


xn>O0 


where the second integral is over all z e BN - 1Y? for which d has k negative eigen- 
values. Furthermore, the mean value of the DT characteristic is given by 


(5.2.22) E{y(A,)} = ASX- DY! Í [sae d)d(u, 0, . .. , xy, Z) dxy dz, 
: XN ^70 


where the second integral is over all € 0? 
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The second equation follows directly from the first by using the representation 
of y as an alternating sum of the y, and noting that the N different regions of 
integration for the second integral.in (5.2.21) (corresponding to k = 0,..., N — 1) 
have as their union the whole of ANA- !2, 

An important special case of the above result arises when the random field 
is Gaussian. From the discussion at the beginning of Section 3.3 it is immediate 
that most of the conditions of Lemma 5.2.1 and 5.22 are automatically fulfilled 
under the assumptions of the following result. 


Theorem 5.2.2 


Let X(t), t € A“, be a zero-mean, homogeneous Gaussian field on S, a compact, 
convex subset of A“ whose boundary 08 has zero Lebesgue measure. We assume 
that X has almost surely continuous partial derivatives of up to second order with 
finite variances in an open neighbourhood of S and that the joint distribution of X 
and these partial derivatives is non-degenerate. Furthermore, let the moduli of 
continuity of the X;; satisfy the condition 


(5.2.23) pmax oh) > 1 = o(h") ash | 0. 


iJ 


Then (5.2.21) and (5.2.22) hold. 


Sufficient conditions, in terms of the covariance function of the random 
field, for (5.2.23) to hold are easily obtainable from Theorem 3.4.1. 

An important aspect of (5.2.23) is that, unlike the condition (5.2.10) on which 
it is based, it places no explicit conditions on the w;. The reason for this is simple. 
It is easy to see that, for small enough h,’ 


wh) < hM, 
where 
M = 1 + max sup X;f(), 
ij S 


‘J 


so that 
max wh) > =M > hte. 
But results we shall later encounter (especially Theorem 6.9.2) imply that, under 
the conditions of Theorem 5.2.2, 
h“'P{M > ht ->00  ash—0 
so that 


iP [mas wh) > j| 20  ash-0. 
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Thus the restrictions we need to place on the œw; are automatically satisfied, and 
(5.2.23) suffices for the conclusions of the theorem to be valid. 

As useful as the above result is, the integral formula for E{y(A,)} is far from 
trivial, and the manner in which it depends on the level u and on the covariance 
structure of the random field is anything but transparent. The task we must 
now face is that of obtaining a more useful form of the expectation formula. 

Before we do this, however, it is probably worthwhile to make one comment 
about the proof we adopted for the preceding results. Essentially we have used 
two different approximation methods to compute upper and lower bounds for 
E{y,}. We have already noted that the first approximation, using the ô, function, 
can also be used to obtain an upper bound for E{y,}, but only at the cost of 
unwanted assumptions. This is not the case for the second approximation, which 
relied on the mean value theorem to approximate the sample functions of X. 
This approximation can be used, without imposing further conditions, to obtain 
a lower as well as an upper bound on E{y,}. Indeed, this is probably the most 
efficient method of proof. Our decision to use the ó, approach was motivated 
primarily by didactic considerations. 

The history of Theorem 5.2.1 is of sufficient interest to be worthy of retelling. 
It was first given, for Gaussian fields in two and three dimensions only, by Adler 
and Hasofer (1976) and later for N dimensions by Adler (1976a). These two 
papers used only proofs based on the ó, function, and so contained a number of 
unnecessary assumptions. In Adler (1976b) essentially the proof given here was 
used, although the unnecessary, and unused, assumption that X was Gaussian 
was made. However, well before these works expectations of variables closely 
related to the DT characteristic were undertaken by other authors. Perhaps 
the earliest was by Longuet-Higgins (1957), who used a rather unrigorous 
technique dating back to Rice (1945) to compute the expectations of such 
variates as the mean number of local maxima of two- and three-dimensional 
Gaussian fields. Similar results for N-dimensional random fields were an- 
nounced in Belyaev (1967), although no details of the proofs were given. In 
Belyaev (19722) and Malevich (1973) a result more general than our Theorem 
5.2.1 is stated, essentially in the form of our ‘corollary to Theorem 5.1.1, but 
with specific regularity conditions given. These conditions are virtually identical 
to those of our Theorem 5.2.2. However, as well as obtaining results for mean 
values, Belyaev (1972a) also gives expressions for higher moments. Again, no 
proof is given. In Belyaev (1972b) details are given for a special case of Belyaev's 
earlier results, this being the case in which only first moments are taken, and, in 
terms of our Theorem 5.1.1, the set A is the whole of #*. This has a substantial 
simplifying effect. The proof given there is essentially that of the ‘approximation 
method' we have used, and, indeed, our proof is based on Belyaev's. However, 
the proofs of Belyaev (1972b) contain at least one mistake, for it is assumed that 
the mean value theorem for #%, our Lemma 5.24, holds with u! = ... = u, 
and this need not necessarily be true. Furthermore, the proofs seem to manage 
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without recourse to a fixed point theorem, seemingly leaving a gap in the argu- 
ment. Nevertheless, despite these difficulties, the general method of proof is 
correct and Belyaev's development of this method was an important and 
substantial contribution. 


5.3 AN EVALUATION OF THE MEAN VALUE OF y 


Before we can commence obtaining a neat expression for the involved integral 
we developed in the preceding section for E(x(A,)) we shall need to develop 
some general results about the moments of Gaussian variates. The first of these 
is as follows. 


Lemma 5.3.1 


Let Yı, Y;,..., Y, be a set of real-valued random variables having a joint 
Gaussian distribution and zero means. Then for any integer m 


(5.3.1) E(Y,Y; +++ Yomai} = 0 
(5.3.2) E(Y, Y: Yom} = DELY; Yin} +++ E{ Yim: Yiomts 


where the sum is taken over the (2m)! /m! 2" different ways of grouping Y,,..., Yom 
into m pairs. 


Note that this result continues to hold even if some of the Y; are identical; 
ie. we can use this lemma to compute joint moments of the form E(Y* ... Yi). 


Proof 


We follow Lin's (1967) proof of this result. From (1.6.4) we have that the 
joint characteristic function of the Y; can be written as 


(5.3.3) y(u) = elef: $ va) = exp[Q], 
where 
Q= Ow) = EY YwEQ Yu. 


Following our usual convention of denoting partial derivatives by subscripting, 
we have 


Q=- DEY Yili, 
(5.3.4) Qu = —E{Y,;¥,}, 


Qu; - O all Lk, j. 
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Successive differentiations of (5.3.3) yield 


y; = VO; 
Wig = WQ; + VO, 
(5.3.5) Vy; = Wn Qj) + Wi Qy + ViQi. 
V1.2 en T Wi, 2 (F-DU+t le .4Q; + STA *Fn- 94; 


where, in the last equation, the sequence r,,...,r, does not include the two 
numbers k and j. 

The moments of various orders can now be obtained by setting u; = --- 
u, = 0 in the equations of (5.3.5). Since from (5.3.4) we have Q (0) = 0 for all j 
the last (and most general) equation in (5.3.5) thus leads to 


E(Yy-- Yj = AES UY JEQ Xj. 


rn-2 


From this relationship and the fact that the Y, all have zero mean it is easy to 
deduce the validity of (5.3.1) and (5.3.2). It remains only to determine exactly 
the number, M say, of terms in the summation (5.3.2). We note first that the 
total permutations of Y,,..., Y;,, are (2m)!, and, since the sum does not include 
identical terms, M « (2m)! Secondly, for each term in the sum, permutations of 
the m factors result in identical ways of breaking up the 2m elements. Thirdly, 
since E(Y; ,) = EY, Y;}, an interchange of the order in such a pair does not 
yield a new pair. Thus 


M(m!)(2") = (2m)! 
implying 


(2m)! 


M= 
m!2™ 


as stated in the lemma. 


For the next lemma we need some notation. Let Ay be a symmetric N x N 
matrix with elements A;;, such that each A;; is a normal variate with arbitrary 
variance but such that the following relationship holds: 


(5.3.6) E{A; A4) = 6i, j, k, D) — by, 


where & is a symmetric function of i,j,k, land 6,, is the Kronecker delta. Write 
[Ax] for the determinant of Ay. Then the following i is truc. 
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Lemma 5.3.2 


Let m be a positive integer. Then under the above conditions 


(53.7) E(|Ao, 411) = 0, 

—1)"Qm)! 
(5.3.8) E(lAg.) = C RD. 
Proof 


Relation (5.3.7) is immediate from (5.3.1). Now 


[Azm] = X M(p)Ari, +++ Azm, izm 
P 


where p = (ii, in,.-., i2_) is a permutation of (1, 2,..., 2m), P is the set of the 
(2m)! such permutations, and (p) equals + 1 or — 1 depending on the order of 
the permutation p. Thus by (5.3.2) we have 


E(|A;,l] = 23 n(p) > E{Ay;, A31] TRU EÍA2 1,6, i 2m, TS 
P Q 


where Q is the set of the (2m)! /m!2" ways of grouping (i,, i2, .. .  i2,,) into pairs 
without regard to order, keeping them paired with the first index. Thus, by 
(5.3.6), 


E{|A2,[} a X n(p) ya té(L, ly 2, i;) Es 01;051,] DÀ {£ (2m Ex l, lm- 1: 2m, ijs) 
P Q 


E Ó5 - l,i2m- 92m, izm 


It is easily seen that all products involving at least one & term will cancel out 
because of their symmetry property. Hence 


E{|A,,,|} = Y n(p) X (— 1)"61;, 593, ieee S 2m, izm 
P Q 


_ (7 D"m)! 
~ — (my o^ 


as required. 


Corollary 


Let Dy = Ay — xIwhere Lis the unit matrix. Then 
(N/2] N 
(5.3.9) E{|Dyl} = (C DÈ (- vs Jun 
j-0 


where a; = (2j) !/j127, and [a] denotes the integer part of a. 
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Proof 


The determinant of Dy is obviously a polynomial in x. A little thought 
shows that |D,| can in fact be written as an alternating sum of powers of x, 
where the coefficient of the jth power is the sum of the (7) principal minors of 
order N — jin|Ay|,j = 0,..., N.(Fordetails of this expansion see, for example, 
Aitken, 1946, pp. 87-88.) Combining this expansion with (5.3.7) and (5.3.8) 
automatically yields the required result. 


It is interesting to note that neither the above lemma, nor its corollary, seem 
to depend explicitly on the & function, or the actual variances of the A;;. There is, 
however, an implicit dependence, for the condition (5.3.6) itself places severe 
restrictions on these variables. 

The final lemma we require is as follows. 


Lemma 5.3.3 


Let X be a Gaussian variate with mean u and variance o°. Let d(u) denote the 
distribution function of a standard normal variate. Then if x* = max(0, x) 


" E H d we 
E(X = [1 of | * ml fh 


Proof 
Note that 


v 2 
E(X*) = ml. xexp| - ex dx. 
Then substitute and integrate by parts. 


This result completes the collection of tools we shall need to evaluate the 
integral giving the mean value of the DT characteristic of excursion sets. We 
can now establish the following theorem. 


Theorem 5.3.1 


Let X(t) be a zero-mean, homogeneous Gaussian random field on @™ and S a 
subset of Jt", and let both satisfy the conditions of Theorem 5.2.2. Then the mean 
value of the DT characteristic of the excursion set A — A,(X, S) is given by 


2 
(5.3.10) E(x(4)) = ASAT Nt 9/21 A|!U/25-GN-D es (- ij 


LN - 19/2) N-1 
LIW qi N-i 2j 
x on (~lya;o ( 3j )u A 


I 
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where a? = E{X?(t)}, A is the covariance matrix of the X (t), a; = (2j)! /j!24, and 
we set (5) = O ifj >N. 


Before we prove this result, it is instructive to see what (5.3.10) looks like in 
special cases. Setting S = I, and N = 1 the formula reduces to 


0.1 1 f(/A,M? u? 

H = = — |— ee 
E{x(e € [0, 1]: X() =o} = = (3) ew = ) 
where 4, = E{|dX(t)/dt|?}. This is exactly the formula given in Theorem 4.1.1 
for the mean number of upcrossings of a Gaussian process on [0, 1]. Such a 
result is of course to be expected, since the DT characteristic reduces to this 
variable in one dimension. 

When N = 2 and S is the unit square we have 


2 
E(x(t e L: X(t) 2 u)} = Qn) ??|A|!2073u ew: 


while if N = 3and S = I, the mean value of y is given by 


» » u? 
(2n)~?|A[!/2a75 ew - x.) (u? — 0°). 
In general, the mean value of x is a polynomial in u with terms of order (N — 1), 
(N — 3), (N — 5),..., multiplied by a negative exponential in u? and some 
dimension-dependent constants. We shall have more to say on the importance 
of this form in the following chapter. 


Proof 


We now turn our hand to proving the theorem. By Theorem 5.2.2 we need 
only evaluate 


AS)-10'! f xp (det d)ó(u, 0,..., xy, Z) dxy dz. 
xn 204 ANO —1)2 
To do so, we firstly perform an orthogonal transformation ofthe field coordinates 
in such a way that the first (N — 1) partial derivatives of the field become 
uncorrelated. A careful choice of transformation will not, by Lemma 4.5.1, 
change the expected value of the DT characteristic. This is done as follows. 
Recall 


A = (Ap, Ai = E{x; Xj}, for ij= Tepes NG 
Let A be the (N — 1) x (N — 1) submatrix of A formed by taking the first 


(N — 1) rows and columns of A. Let Q be an orthogonal matrix which reduces 
A to diagonal form; i.e. 


(5.3.11) QTAQ = diag(A,..... Ay 1) 
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for some À,,..., Ay. i. Since À is positive definite the A, are all positive. Form a 
larger matrix Q from Q by setting q; = Qi; 5 j = L..., N — 1, diy = Qui = O, 
i= 1,..., N — 1l, qyy = 1;ie. 


5 or 
(5.3.12) e-(5 1) 


We can now transform the coordinate system of the field X by mapping each t 
to a point 


+= Qt. 


Such a mapping trivially satisfies all the conditions of Lemma 4.5.1, so we can 
work from now on in the t coordinates. However, if X(t) is the random field 
defined by 


X(t) = X(Q^ 4), 


and V X and VX are, respectively, the vectors of the first-order partial derivatives 
0X/0t; and 0X/dt, then 


ie. V£ — VXQ. It follows that 
E(VÉ'VX) = Q'E(VX'VX)Q = Q"AQ. 
Thus, in view of (5.3.11) and (5.3.12), we have that 


«prot 


OT; x = Ôi jAi» for i, j = L..,N-— 1, 


where ó;; is the Knonecker delta. Thus if we write W to denote the variance- 
covariance matrix of VX we have, for some H1, ..., Hy. 1; An; 


Ay Hı 
W= t. : . 
Ån-1 HN-1 
Bic Hyn- ÀN 


We note, for future reference, that on returning to the original coordinate 
system we have, by expanding |W | and proceeding inductively, that 


N N-1 N-1 
(5.3.13) - [Al 2 (Wl = []4,- È (47 ITA]. 
jot 1 


"ha 
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Since the field X(t) is still homogeneous and Gaussian, the independence 
between its various partial derivatives discussed in Section 2.4 holds, so that we 
can write 


(5.8.14) (u,0,...,0, xy, Z) 

= bo(u)d,(0)--- dy -1(0) Ó (xy x; = 0,j < N)on+i1(2Z|x = 0) 
where the @, are all Gaussian densities in an obvious notation. It is easy to 
check from (1.6.5), (1.6.6), and (5.3.13) that 


Iw] 


N-1 7° 
j=l Àj 


E{X3(t)|X{t) 20j < N} = 


Applying this and the above decomposition of $ to the expression for 
E{x(A)} and integrating out xy (cf. Lemma 5.3.3) we obtain 


_ N-—1 -1 u?/2 
E{x(A)} = ASX- DYT 120)" 8 * a wil) exp( - m) 


x face d oy. (zlu) dz. 
(5.3.15) 


It remains to evaluate the integral in this expression. The elements in d are 
the variables £ iAt), i, j = 1, 2,..., N — 1, jointly normally distributed, and, 
according to Sections 1.6 and 24, satisfy the following relationships, for i, j = 
1,2,...,N — 1, 


OyA 


(5.3.16) ERDIRA) =u) = - 7 J" = E, say, 
(5.3.17) E([£;(*) — EX Wd) — Eal XO = u} 


Ô; Ör ATA 
= &(i, j, kl) — LERTE, 
e 
where &(i, j, k, D) is a symmetric function of (i, j, k, I). Now make the transforma- 
tion 
u 


Aij = X) US 


o 
JA, +6 
Then it is a simple consequence of (5.3.16) and (5.3.17) that 
E{A,;|X(t) = u} = 0 
and 
E{A,,Ay| X(t) = u} = MM — àjóu. 


5.4 THE MEAN VALUE OF THE IG CHARACTERISTIC WHEN N = 2 115 


Thus, conditional on X(t) = u, the A; , Satisfy the conditions of Lemma 5.3.2. 
Applying the corollary to this lemma we now easily obtain 


Í det doy, (Zlu) dz = ef T] B «eda = ay X(t) = i 


ae e(Te or" ghe" : jc 
g jz0 2j 


o 


Combining this with (5.3.15) and noting |A| = |W] suffices to complete the 
proof of the theorem. 


54 THE MEAN VALUE OF THE IG CHARACTERISTIC WHEN 
N=2 


We now specialize for a moment and restrict ourselves to two-dimensional 
random fields. For such fields we showed, in the preceding chapter, that excur- 
sion sets are well described by their IG characteristic, F, which was derived from 
integral geometric considerations. To obtain its mean value we shall use the 
relationship between y and F along with the results of the preceding section. 

From the point set characterization of F given in Theorem 4.3.2 and the 
definition of y, we have that if X is suitably regular, and A, = A,X, I,), then 


(5.4.1) T(A,) = XA) + T7 - T7, 


where 


~ 
N+ 
it 


# {x €e (0, 1]: X(1, x) = u X,(1, x) > 0, X,(1, x) > 0}, 
# {x € (0, 1]: X(0, x) = u, X,(0, x) > 0, X4(0, x) > 0}. 


It is obvious from the homogeneity of X and the definitions of r} and Fy 
that as long as their expectations are well defined, they must be identical, from 
which (5.4.1) would imply E(T(A,)) = E{y(A,)}. However, the finiteness of these 
expectations is ensured by the fact that they are bounded above by the numbers 
of level crossings of the processes X(1, x) and X(0, x), x € (0, 1], and these 
numbers have finite mean in view of the suitable regularity of X and Theorem 
4.1.1. 

Thus we have the following result, essentially a corollary of Theorem 5.3.1. 


rz 


il 


Theorem 5.4.1 

Let X(t) be a zero-mean, homogeneous Gaussian random field on R? satisfying 
the conditions of Theorem 5.3.1. Then the mean value of the IG characteristic 
of its excursion set is given, in the notation of Theorem 5.3.1, by 


2 
(5.4.2) EILA D} = Qn) PJAN Poa Au ow a 
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There are no problems that arise when the proof of Theorem 5.3.1 is used to 
obtain (5.4.1). The only apparent difficulty, i.e. that associated with justifying 
the orthogonal transformation of the field coordinates to simplify computation, 
disappears when it is noted that in the case N = 2 this transformation reduces 
to the identity transformation. 

It 1s, however, a simple matter to derive (5.4.2) without recourse to Theorem 
5.3.1 and, in so doing, derive expressions for E(T 1 ) and E(T, ) which are them- 
selves of some interest. Recall, from Theorem 4.32, that if F} (T, ) is simply the 
number of points in I, satisfying 


(5.4.3) X(t) — u= X,(0 ^ 0, X,(t) > 0, 

and X,,(t) < 0(X1,(t) > 0), then in view of the comments preceding the above 
theorem we must have 

(5.4.4) E(T(A) = E(T1) — E(T1). 

Under the conditions of the theorem it follows, as in the preceding section, that 
(5.4.5) EI} = Í Re X110(u, 0, x2, X11) dx; dxy4, 


where ¢ is the density of (X, X,, X2, X,,), while the mean value of Fy is given 
by the same expression with the second integral over the range x,, > 0. 

According to the correlation structure exhibited by the four variates appearing 
in the density @ (cf. Section 2.4) we can rewrite (5.4.5) as 


E(T 1) = QnoA12)* 'exo(- ia) PIXE |X, = O}E{X7,|X = uj 


20? 
where, as usual, x* = max(0, x), x^ = min(0, x). But now applying (1.6.5), 
(1.6.6), and Lemma 5.3.3 it is easy to check that 
(5.4.6) 


2 
E(T?) = Qn)? |A|o7! ep- x) 


u A v? — A2,/o? AA 
1 o 11 11 
» E l E] "TAG exp( Xom? — 25)! 


where v? = E(X?,) and 6 is the standard normal distribution function. 
Similar calculations, carried out on the integral expression for E(Tj }, yield 


(5.4.7) 7 
E(T1) = Qn)^??|A|c^! ew(- i) 


u Aud SI = At /0 a ( A? uz ) 
——|l-9 — ; — ET e 
n | 5| (2; P2 Ima, B Xo?v? = Àj) 
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Piecing these last two equations together as in (5.4.4) immediately yields (5.4.2), 
as claimed. 

However, as well as providing a more direct derivation of (5.4.2), these two 
equations contain interesting information in their own right. For example, 
using the well-known and easy to check approximation 


1 — Dix) = 


1 
e PTL + Oxo 1D] 
x 


(e.g. Feller, 1968, p. 193), these equations easily yield 
EITI} ~ Eff}, u large, 


while E{F 7 } is of a lower order of magnitude than both of these expectations. 
Thus, at high levels, it would seem that there are (relatively speaking) far more 
points in I, satisfying (5.4.3) and X ,(t) < 0 than this condition and X ,,(t) > 0. 
The reasons why this phenomenon should occur, and its implications, form the 
bulk of the following chapter. 


5.5 AN ESTIMATION AND INTERPOLATION PROBLEM 


In our discussion on excursion sets in @' at the beginning of the previous 
chapter, we noted that level crossings could be applied to a particular estimation 
problem. The parameter to be estimated was the square root of the second 
spectral moment, 4; , and a family of unbiased estimators was given by 


A u? 
O(u) = 2nc exp G) N, 


where N, is the number of level crossings of the level u in [0, 1]. Our interest 
in estimating A1? was its appearance in the limiting distribution of the maximum 
of a Gaussian process over a long time period (see 4.1.2). The corresponding 
result for random fields, Theorem 6.9.4, involves the entire matrix of second- 
order spectral moments, but only through the square root of its determinant. 
However, this determinant is simply the factor |A| appearing in Theorems 
5.3.1 and 5.4.1, which give the mean value of the DT and IG characteristics of 
excursion sets. Thus, it follows immediately from these results that, as long as 
the variance of the random field is known, the DT and IG characteristics, 
multiplied by known functions of the level u, can serve as unbiased estimators of 
| A|U2. 

In practical situations a difficulty often arises when the field is observed only 
as some lattice subset of #% rather than over the whole of the space. In this case 
it is not possible to determine exactly the actual excursion sets of the underlying 
field, nor their characteristics. Hence some sort of approximation procedure is 
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ie 0.5 1 


Figure 5.5.1 Approximating an excursion set on the 
lattice L4 


desirable. In this section we shall consider in detail a sequence of approximations 
to the IG characteristic in Z?, while also giving a briefindication of what happens 
in 2°. These two cases are the ones of most practical interest. 

Following our practice of advising our reader, we note that although the 
material of this section is extremely valuable for applications, and is far from 
devoid of theoretical interest, it is completely unrelated to later chapters and 
so may be passed over on the first reading. 

We restrict our attention now to a suitably regular random field defined on 
the unit square I, in 2°. For n > 1, let L, denote the lattice of the (2" + 1)? 
points in J, of the form (i-27",j- 27", i,j = 0,1,...,2", and suppose that for 
some fixed number u and some n all we know about the field X is whether its 
value at each point of L, is above or below the level u. 

Consider for a moment the example in Figure 5.5.1, in which a single closed 
level curve encloses the excursion set A,. Also in this illustration is the grid L4 
and a ‘natural’ approximation to A, obtained by joining with horizontal or 
vertical lines all neighbouring points of L, lying in A,. For general n > 1, let 
N,(S) denote the number of ‘squares’ (ie. quadruples of neighbouring L, 
points serving as the four corners of a square) in the approximation to A, 
based on L,. Strictly speaking, N,(S) is also a function of u, but we can drop this 
parameter without causing confusion. Similarly, let N,(V), N,(H), and N,(P) 
denote, respectively, the number of vertical and horizontal lines (i.e. neighbour- 
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ing pairs of L, points) and points of L, in this approximation. Then note, for the 
example of Figure 5.5.1, that we have 
N3(S) = 3, N(V)=11, — N4H)-9, N3(P) = 18, 
and 
N3(S) — EN3GI) + NX(V)] + N3(P) = 1 = T(A,). 


Relations of this form in fact hold in general, although we have to introduce a 
‘boundary correction’ term to cover the case when A, intersects one or other 
of the axes. To this end let us write N*(V) and N*(H) to denote the number of 
pairs of neighbouring points of L,, both of which lie in A, and are on the vertical 
and horizontal axes, respectively. Similarly, let N*(P) denote the number of 
points of L, on either axis lying in A,. Then we have the following result. 


Theorem 5.5.1 

Let X be a two-dimensional random field, suitably regular for the level u over 
I,. Then, with probability one, 
(5.5.1) I(A,) = lim T,(A,), 


n> 


where 
FG, = N,(S) — EN) + N,GOD] + N,(P) — ENZ(O) — N¥(V) — NCH). 


Proof 
We start by recalling that the characteristic F(4,) was originally defined to be 
(5.5.2) F(A.) = 9(A,) — e(4, n 1), 


where ¢ is the Hadwiger functional and I, consists of the two sides of J, which 
lie on the axes. If we call these two sides S, (horizontal) and S, (vertical), then the 
additivity property of o ensures 


(553) — e(A,n 1) = PAu A S1) + e(A, A S2) — GA, ^ (0)). 


Consider now the term @(A, ^ S1). Let N*(S,, P) be the number of points in 
A, L,0 S, Then with probability one N¥(S,, P) — N*(H) is non-decreasing as 
n increases, since although increasing n can increase both N*(S,, P) and 
N*(H), the second term can never be increased by more than the first. Hence 
N*(S,, P) — N*(H) converges to a limit, 9*(S, © A,) say. It is now not hard 
to see that this limit must in fact be g(A, © S,), (The details, if desired, can be 
developed by comparing the essentially equivalent result in Cramér and Lead- 
better, 1967, p. 195, which shows that the number of upcrossings of a one- 
dimensional process can be obtained via this type of approximation.) Combining 
this with the corresponding result for the other axis yields, via (5.5.3), 


(5.5.4) NI) NAH) - NK) "S (AL 1) — asno ox. 
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We now turn our attention to q(A,). For a given realization of the field, 
consider the points t € J, at which 


X(0—-u-X,0-0 or X(0-u-X,0-0 


For each of these points, and there are almost surely only finitely many, draw a 
line parallel to the t; axis, or, respectively, the t, axis, containing the point. These 
lines form another grid over J, and, with probability one, contain none of the 
points of L, for any n. Furthermore, each connected region of A, within each 
cell of this grid is a basic (see the proof of Theorem 4.3.1). Given this, it is not 
difficult to see that there exists an N z 1 such that the approximations to these 
basics, based on squares and lines with corners and ends in L,, n 7 N, are 
connected sets without ‘holes’. Thus the approximation to each basic has a 
Hadwiger characteristic of unity, as does the basic itself. It is a simple conse- 
quence of the definition of the Hadwiger characteristic (see 4.2.3) that the 
characteristic of each of these approximations is given by 


(5.5.5) NAS) — EN(W) + NOD] + NC), 


where we are, of course, only counting the squares, lines, and points wholly 
contained within each basic. 

Thus we have that for large enough n an approximation based on L, gives us 
sets with the same Hadwiger characteristic as each basic in A,. However, to 
get at g(A,) we need to know what happens when we put the basics together 
to make up A, again. To do this we simply apply the defining formula (4.2.3) 
of ọ once again, which, when combining with (5.5.3), yields 


PCA) = N,(S) — [N,(V) + NGD] + NO) 


for large enough n. This result, combined with (5.5.2) and (5.5.5), is sufficient to 
establish the theorem. 


As an aside, it is interesting to query whether or not this approximation pro- 
cedure can be used to replace either of the other two that were developed for the 
evaluation of the mean value of excursion characteristics. In fact, in one di- 
mension when we need only deal with the number of level crossings, it is an 
approximation of essentially this form that yields results with minimal assump- 
tions. Our experience, however, has shown that this may not be the case in 
higher dimensions, for essentially two reasons. The first is that the above 
approximation argument is difficult to carry over beyond the two-dimensional 
case in a rigorous fashion. The second problem, which arises even in the two- 
dimensional setting, is that there is considerable difficulty in showing that the 
expectations of the approximating I, converge to the appropriate limit. Never- 
theless, it may well be a worthwhile task to procced via this approximation and 
perhaps obtain a version of Theorem 5.4.1 under weaker conditions. 
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Finally, let us consider, without going into detail, what Theorem 5.5.1 would 
look like in three dimensions. Since once again we would have 


T(A,) = P(A) Ez P(A, A 1) 


and Theorem 5.5.1 tells us how to approximate g(A, ^ 1,) (since A, n 1, is 
made up of three two-dimensional components), we need only approximate 
E(u). Let L, be the lattice of points in J, of the form (i- 27", j- 27", k-27”, 
i,j,k = 0,..., 2". Let 


N,(C) = number of cubes (sets of eight neighbouring points in L, which act as 


the vertices of a cube) at all of whose vertices X > u, 


N,(S) = number of squares (sets of four neighbouring points in L, which act 
as the vertices of a square) at all of whose corners X > u, 


N,(L) = number of lines (neighbouring pairs of points of L,) for which X > u, 
N,(P) = number of points in L, at which X > u. 


Then, following along the lines of the proof of the preceding theorem, one can 
establish 
P(A) = lim [N,(P) — NL) + N,(S) — N,(C)] 
with probability one. 
The actual, detailed, proof of this result is too long and unenlightening to be 
worth giving here. 


CHAPTER 6 


Local Maxima and High-Level Excursions 


We now turn our attention away from the direct study of excursion sets and 
their characteristics to redirect it towards a study of the local maxima of random 
field sample paths. The motivation for doing this comes primarily from practical 
situations, where maxima often have an important role to play. In Chapter 1, 
for example, we gave a brief description of the random field model of rough 
surfaces and noted that if two such surfaces are placed in contact then actual 
contact between the surfaces only occurs when the ‘hills’ of one surface adjoin 
the ‘hills’ of the other. Of course, these ‘hills’ are simply the local maxima of the 
two surfaces, so it is clear that before any statistical study of contact-related 
phenomena such as friction can be undertaken, a knowledge of the statistical 
behaviour of maxima is needed. The derivation of the exact form of this behaviour 
forms the bulk of the content of this chapter. 

We commence in the first section by obtaining an expression for the mean 
number of local maxima of a Gaussian field. As in our evaluation of the mean 
value of excursion characteristics the first step is to obtain an awkward integral 
expression for the mean value we now require. However, unlike the excursion 
characteristic case, this integral does not easily yield to simplification to give a 
simple closed formula in other than one very special case. This is the case when 
the field is two dimensional and isotropic, so we investigate this case in some 
detail, essentially retelling the work of Longuet- Higgins (1956). 

Despite these difficulties, however, it turns out to be possible to obtain an 
asymptotic expression for the mean number of maxima above an arbitrarily 
high level, which we proceed to do in Section 6.3. The result that arises is rather 
strange, for it turns out that, under standard conditions, the mean number of 
maxima of a Gaussian field above the level u is asymptotically the same as the 
mean value of the DT characteristic of the excursion set at the level u. This 
result seems to imply the possible existence of a close relationship between 
high-level excursion sets and local maxima, with further significant implications 
about the shape of sample functions at high levels. This problem is discussed 
in some detail in the remainder of the chapter, where we cover the work of 
Lindgren (1972), Nosko (19694, 1969b, 1970), and Adler (1976b, 1977a). 
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Finally, we note here that the bulk of this chapter is concerned with the local 
maxima of random fields only. Problems related to global maxima are taken up 
only in the final section, and then only in considerably less detail. 


6.1 THE MEAN NUMBER OF MAXIMA 


Throughout this section and, in fact, the remainder of this chapter, we shall 
assume X(t) is a homogeneous, Gaussian, N-dimensional random field and S 
a compact subset of 4" whose boundary has zero Lebesgue measure. We shall 
start by obtaining an expression for the mean number of local maxima of X 
above the level u in S. Writing X; and X;; for the usual partial derivatives of X, 
and defining 


X' = (X, ..., Xy) X" = (Xy. fori,j- L...,N, 


it follows that X will have a local maximum above the level u at the point te S 
if the following conditions are satisfied: 


X(0—0,  ÀX(02w 
X"(t) is a negative definite matrix. 


Written in this form it is clear that the number of local maxima will, under 
appropriate regularity conditions, fall into the general framework of Section 5.1, 
so that the ‘corollary’ to the theorem there tells us that 


(6.1.1) E{M,{S)} = X(S) II [det x"(t)| h(x, 0, x") dx dx”, 


where M,(S) is the number of local maxima of X in S lying above the level u, 
$(x, x’, x") is the joint density of (X, X’, X^), and the second integral is over the 
region in ZYN +12 in which x"(t) is negative definite. (For the purposes of this 
formula X" is also to be considered as an N(N + 1)/2 vector.) 

The 'corollary from which (6.1.1) follows is vague about the regularity 
conditions required for this result to hold, and to carefully derive this result 
in full rigour essentially requires a repetition of the arguments used to obtain 
the rigorous derivation of E{y(A)} of the preceding chapter. Since there seems 
to be little to be gained from such detailed repetition, we shall content ourselves 
with simply a statement of the following result and leave it to the reader, if he so 
wishes, to convince himself that the details of its proof are as we have indicated. 


Theorem 6.1.1 


Let. X(t) be a zero-mean, homogeneous Gaussian field on S. Assume that X 
has almost surely continuous partial derivatives of up to second order with finite 
rariances in an open neighbourhood of S and that the joint distribution of X and 
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these partial derivatives is non-degenerate. Furthermore, let the moduli of con- 
tinuity of the X ;; satisfy the following condition for any £ > 0: 


(6.1.2) P\max eh) > e} =0o(h)  ash|0. 


LJ 


Then, in the above notation, 
(6.1.3) E{M,(S)} = A(S) Í fidet x"(t)| d(x, 0, x”) dx dx”, 


where the second integral is over the region in AYN * Y? in which x"(t) is negative 
9 g g 
definite. 


The integral in (6.1.3) looks very similar to that yielding the mean value of the 
DT excursion characteristic, and one is thus led to hope that it might yield to a 
similar mode of attack. However, the awkwardness of the domain of the second 
integral makes it far more complicated, and a simple expression for (6.1.3) is 
known only in the two-dimensional case, and then only when u = — œ. 

To derive this expression we need the following lemma. Recall that a point 
t e A` is called a critical point of X if X(t) = 0 for all i = 1, ..., N. We write 
C,(S) for the number of critical points of X in S above the level u, and M and C 
for the total number of local maxima and critical points, respectively, of X 
in I,. 


Lemma 6.1.1 


If X is a two-dimensional random field satisfying the conditions of the pre- 
ceding theorem then 


(6.1.4) E{C} = 4E{M}. 


Proof 


Since X is homogeneous it is clear that if C(r), M(r), S(r), respectively, denote 
the number of critical points, maxima, and saddle points of X within the 
square D(r) = (te 22:0 < t, t; < r} then 


(6.1.5) E(C() = PE(C(1) = P E(C), 


while identical relationships hold for maxima and saddle points. 
Consider now the IG characteristic of the set A(X, D(r)); i.e. the excursion 
set of X above zero. By Theorem 54.1, 


(6.1.6) E{T(A((X, D(r))) = 0. 
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Note that the excursion set Ag( — X, D(r)) is simply the closure of the comple- 
ment of A,(X, D(r)) in D(r), and since this set obviously has the same distribu- 
tional properties as A(X, D(r)) it also has a mean IG characteristic of zero. 

If we now write A, as the disjoint union of those components of A, which do 
not intersect @D(r) and those which do, denoted respectively by A* and A, we 
have 


(6.1.7) E{T (A(X, D(r)] = E{T(4*) + E{P(A,)}. 
Applying the relationship between the IG and DT characteristics discussed in 
Section 4.5, it follows from Theorem 4.4.1, by setting Z = A* and f = X, that 
(6.1.8) T(4*) = # {maxima and minima of X in D(r) above 0} 

— # (saddle points of X in D(r) above 0}. 

Furthermore, for large r, the number of components in A, is clearly of the 
same order as the number of zero crossings of X(t) on @D(r). This, by Theorem 
4.1.1, is of order O(r). Since this argument also holds for Ag( — X, D(r)) we have, 
combining this with (6.1.6) to (6.1.8), that 

O = E(I(AG(X, D())) + E{T(Ao(— X, DG) 
= 2EÍM(r)) — E(S()) + O(r) 
since the mean number of maxima and minima must be identical. Using (6.1.5) 
for M(r) and S(r) and letting r > oo yields 


E(S(1)) = 2E{M(1)}. 


But since C(1) = #{maxima, minima, and saddle points in D(1)} this implies 
E(C(1)) = 4E{M(1)}, which proves the lemma. 


We now commence our evaluation of E{M}, following the original analysis of 
Longuet-Higgins (1957). By setting u = — oo and integrating out x in (6.1.3) 
we obtain 


E{M} = Í (x11X22 — x12)0(0, 0, X11, X12, X22) dx11 dx22 dx22, 
R 


where R is the region in 2° over which x,, < 0 and x,,x5; — x2; > 0, and ó 
now denotes the joint density of (X1, X2, X11, X12, X22). 

We showed in Chapter 2 (cf. 2.4.10) that second- and third-order derivatives 
of homogeneous fields are uncorrelated. Thus, since X is Gaussian, $ factors 
into the product of two densities and we have 
(6.1.9) 


E{M} = Qn|A|7) ! Im — xi13$Gi i. X12. X22) dx dxi 2 dx22, 
` R 


where A is the usual matrix of second-order spectral moments: ie. = E(X;X jl. 
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Let M be the covariance matrix of (X11, X12, X22). Then if mj; denotes the 


spectral moment ff 444) dF(A), i + j = 4, it follows from Section 2.4 that 
Mag M3, M22 
M =| m3, mj; mJ 
M22 mis mo, 


Thus M is composed of a mixture of fourth-order spectral moments. Let T be 
the matrix given by 


0 0 1 
T=ļ0 -1 0 
4 0 0 


Let A bean orthogonal matrix, with A* the diagonal matrix (AATY !, for which 
(6.1.10) ATMA = I, ATA*TA*A = D, 


where I is the unit matrix and D is a diagonal matrix with elements d,, d2, d3. 
Since both M and T are symmetric and M is positive definite, such a matrix A 
can always be found. Transform X" = (X ,,, X12, X22) via Ato get new variables 
Y = (Y, Y», Y3) given by 


Y-2X".A. 


Then the Y, are clearly independent, standard normal variates. Furthermore, 
since 


X,1,X5; — Xi, = XTX 
we have 


(6.1.11) X11X22. — X15» 0 YDY" > 0 
«d,Yi + diY2 + d3 Y3 » 0. 
The d; are easily found, for it follows from (6.1.10) that 
A'MT = DA? 


from which we have that the d; are the eigenvalues of MT. Hence they satisfy 
the characteristic equation det(MT — dI) = 0; ie. 


1 1 
2m»; -d —mM3, 2M40 

1 1 

3M 13 —mj;,—d 3ms =0 
1 1 

2mo4 —mis 3M. —d 


On expanding the determinant we find 


(6.1.12) 4P — 3Hd — |M| = 0 
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where 

(6.1.13) 3H = m4o9mo4 — 4ms,m,4 + 3m,. 
Hence 

(6.1.14) d; + d, + d3 = 0 

and 

(6.1.15) did5d4 =%|M| > 0. 


It follows that one of the eigenvalues, d, say, is positive while the other two, 
d,, d3 say, are negative. We write 


d; > 0 >d, 2> dz. 
We now claim that E{M} is given by 
(6.1.16) E{M} = Qn) ??|A|"!?T'(d,, do, d3) 
where 


I'(d,, dy, d3) = Í exp[-4(y? + y3 + y3)ydiyt + doy$ + d3 y3) dy, dyz dys 
» 


and D' is the conical region where 
ERU dıy? + d2y3 + d3y3 > 0. 


To prove this claim we first note that the same proof that is used for Theorem 
6.1.1 can be used to prove 


E(C) = Qz|A[2)7! Í Axa 7 Fal BGs X12» X22) dii i2 dxaz» 
R 


in the notation of (6.1.9). Under the transformation Y = X”A the integral 
in this expression is clearly equal to (2m) ??I(d,, d2, d3) where 


I(d,, d2, d3) = Í expL- $61 + y$ + y3X]ldiyi + dayi + daysldy: dyz dys. 
a 


By breaking up this integral into four sections depending on whether the 
expression within the modulus signs is positive or negative, and on the sign of 
yp it is straightforward to check that 


4l—-I- i expL- 361 + y3 + y3Ydiyi + d2y3 + d3y3) dy, dy; dys 
a 


= Qx)*(d, +d, + d3), 


which vanishes, by (6.1.14). Thus /' = 1I. But by the preceding lemma E{M} = 
4E{C}, so by applying this to the above facts our claim of the veracity of (6.1.16) 
is established. 
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To complete this problem we need to evaluate the integral I’. For this we use 
the substitution 


yy = dy "^r, 
y; = (7d) !"rsin 0 cos v, 
y3 = (—d3) '?r sin 0 sin y, 
0O«cr«o,0«0«in0cwy-c2mn. 
We then have 


2 
I' = (d,d,d3)~ E arf dà ay ZE (1 +f sin? 0) arem 0 sin 0, 
1 


-jse (oo y- (sin " 


Integrating (by parts) with respect to r gives 
di cos? 0 sin 0 
‘= 3n)? —— do| d . 

PSIGE Gay? i K VOF F sim? 0j? 
Further integration with respect to 0 (by transforming to x = sin 0) gives 
di 2 
—— 7 d med zl. 
ai), "bp ne 
This rather forbidding looking integral can actually be evaluated, although 
not in terms of elementary functions. Using methods dating back to Legendre 
(1811) or, alternatively, turning to a more modern treatise listing tables of these 


integrals (e.g. Byrd and Friedman, 1971), it is possible to find, after considerable 
algebra of the kind modern mathematicians rarely tackle, that 


d n d, 1/2 ^ d 1/2 i 
r= masa (274) E(k, 27) E " = i) F(k, | 


— (d, +d, + d)F(C, 0) E(k, 4x) + E(k’, 0)F(k, 3x) 


where 


I Z 4(1n)!? 


- F(k’, OF(k, $) — ih 
Here E and F are the Legendre elliptic integrals of the first and second kind: 


6 
E(k, 0) = [a — k? sin? a)? da, 
0 


0 
F(k, 0) = Í (1 — k? sin? a) 12 da, 
0 
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and 


k? = T 7 2. k?=1- k, 0-— av (- a 
d, — d 


If we now make use of (6.1.14) and (6.1.16) we finally obtain 


_ 1 (dad)? | (d - VP, da S 
E{M} — 205? JA |1? d, E(k, 3n) d, m di F(k, 3n) t 
(6.1.17) 


To actually evaluate this expression we need to know values of d,, d,, and 
d5. It is straightforward to check that setting 


(6.1.18) d; = H"? cos 0j, 


where 04, 05, 0, are the roots of 


IMI 

PE 

will in fact produce d;'s satisfying the defining equation (6.1.12). Finally, on 
defining a function G(a) by 


cos 30 — 


(61.19) G9 - px — op? (59V Ea, dy — (< Fi an], 
l+a 


[»4 
where 
1-2 
P=, 0<a<}, 


we find we have proven the following theorem. 


Theorem 6.1.2 


Under the conditions of the preceding theorem the mean number of maxima of a 
two-dimensional field in the unit square is given by 


1 d d 
6.1.20 E{M} = 5 G|- 4]. 
(6.1.20) M = xix ( 5 
The mean number of minima is given by the same expression, while the mean number 
of saddle points and critical points are, respectively, two and four times this ex- 
pression. 


The form of G(a) is shown in Figure 6.1.1. When « > 0, k? > 1 and simple 
estimates show that F(k, 37) -+ o logarithmically. Hence aF(k, $2) > 0 and 


(61.21) — — lim G(x) = 1. 


x of} 
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1.00 


0.96 


Ge) 


0.92 


0 0.1 0.2 0.3 0.4 0.5 


Figure 6.1.1 Graph of G(a) 


When a = 4, k? = 0 and so 


(6.1.22) GG) = Qi? — icu zt =0907.... 


2/3 

Throughout the whole of its range G departs very little from unity. 

In the following section we shall have a closer look at (6.1.20) for the special 
case of an isotropic field, in which E{M} can be expressed in a simpler form. 

The above theorem, while useful and interesting, does not really solve the 
problem we set at the beginning of this section; i.e. to obtain a useful expression 
for the mean number of maxima of an N-dimensional field above a level u 
other than u = — œ. We can see from the derivation of this result, however, 
that a solution to the general problem would not be easy to obtain. In fact, in 
spite of the obvious importance of this problem from the point of view of 
applications, it has defied solution for over two decades. In the third section of 
this chapter we shall return to this problem to show that a partial solution, at 
least, is possible. There we shall derive an asymptotic (in u) formula for the mean 
number of maxima above a high level u. 


62 THE ISOTROPIC CASE 


The formulae of the preceding section simplify considerably when the under- 
lying random field is isotropic as well as homogeneous. In this case many of the 
spectral moments vanish, and simple relationships exist between the remaining 
ones. Since the formula for the mean number of maxima was an involved 
function of these moments, this has a significant simplifying effect. 


6.2 THE ISOTROPIC CASE 131 


Recall that a random field was said to be isotropic if its covariance function, 
R(t), was simply a function of ||t||. In this case R is clearly also a (different) 
function of |||]? = (t? + 13), and we define this function, R say, by 


R(t) = R(t?) 
so that 
E{X(ty, t2)X(0, O} = R(t) = Ree? + d). 


The various spectral moments of X are easily obtained as derivatives of R 
at the origin. For example, we have 
OR(t) 


1 


= W ROE + 12) 


where the superscript denotes the order of derivative of R. Hence 


R(t) 


Ab 2RO( + 3) + Au RO + 0). 


Thus, denoting second-order spectral moments as usual, we have 


Ay = ad uc —2R"X0) 
Similarly, 
Aga = ~2R(0) 
and 


Ay2 = E{X,()X2(0} = 0. 


Furthermore, proceeding in this fashion it is easy to check that, in the notation 
of the preceding section, 
Mao = Moa = 12R(0) 
M2 FE 4R(0). 
Finally, using either the spherical symmetry properties of the spectral distribu- 


tion function (see Theorem 2.5.2 and the discussion preceding it) or differentia- 
tion as above, it follows that for the odd moments we have 


mia = Hai, = 0. 


Thus, if we introduce two new parameters to denote the variances of the 
first- and second-order partial derivatives of X by setting 


(6.2.1) vy, EEN POD}, v; c ELX Zt 
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we can write all the second- and fourth-order spectral moments in terms of these 
two parameters: viz. 


(6.2.2) Ay = Ax = LU Mao s moa4 = V2, 
EE = = = 
M22 = 3V2, Aiz = m3 = ms, =Q. 


On substituting these equations into equation (6.1.12) of the preceding 
section, it is easy to see that the eigenvalues d; that determine the mean number 
of maxima of X are the solutions of 


27d? — 9v3d — 2v3 =0. 
By (6.1.18) the three solutions to this are given by 


2v, =v, —v 
(6.2.3) (d, dz, d3) = (s $ 3 2 , 3 


We can now substitute directly into (6.1.20) to obtain, with the help of (6.1.22), 
the following theorem. 


Theorem 6.2.1 
If the conditions of Theorem 6.1.2 hold and, furthermore, X is isotropic, then 


1 v 


6.2.4 = se 
| M 614/391 


where v, and v, are the variances of the first- and second-order partial derivatives 
of X, as defined by (6.2.1). 


An interesting observation which will always be true, but which is most 
transparent in the isotropic case in view of (6.2.4), is that the mean number of 
local maxima is independent of the variance of the field itself. This is, of course, 
to be expected, since a change of scale, and thus of variance, will not affect the 
numbers of the various critical points of any realization. 


63 THE MEAN NUMBER OF MAXIMA ABOVE HIGH LEVELS 


As we have already noted, the derivation of a simple expression for the mean 
number of maxima above a level u > — oo has never been successfully achieved. 
It is possible, however, to obtain a reasonably simple asymptotic expression for 
this quantity when the level vis high. This is given in the following result. 
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Theorem 6.3.1 


Let X be a Gaussian field as in Theorem 6.1.1 and let M (S) denote the number 
of local maxima of X above the level u in S. Then 


A(S)| A |"? N-1 2 1 
31) EMO) = eee em - s) * 9] 


where o? is the variance of X and A is the usual matrix of second-order spectral 
moments. 


To prove this result we require the following lemma. 


Lemma 6.3.1 


Let V be an N x N symmetric matrix and let Y the vector of length K = 
N(N + 1)/2 formed by placing the successive columns on and above the main 
diagonal of V under one another. Let A be a diagonal matrix with positive values 
on the diagonal and let D, be the region in &* defined by 


Y € D, © V — uA is negative definite. 


Then if o(r) = (y e RE: |ly|| < r} we have that there exists a finite C dependent 
only on A such that, for each r and u > Cr, 


olr) c D,. 


Proof 


Let P be an orthogonal matrix such that P" AP is the identity matrix I, and Q 
an orthonormal matrix reducing P7VP to diagonal form, i.e. 


Q"PTVPQ = diag(d,,..., dy). 
Then 
(PQ)'(V — uA)(PQ) = diag(d, — u,...,dy — u). 
Since the expression on the left-hand side will be a negative definite matrix if 
and only if V — uA is, it follows that V — uA will be negative definite provided 


u > max;d,. 
Note that if we write PQ — A, then A is an orthogonal matrix and 


2 È asvyas 
j k 
1/2 
< sup "(E aj, 2, 2 
kj k 


d 


|d;| = 


C sup [rul 
ki 
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where C is a finite constant depending only on the elements of A. Thus ifu > rC 
and Y e o(r), we have that u > C sup|v,;| so that u > max; d;, implying that 
V — uA is negative definite. That is, o(r) = D, for all u large enough, which 
proves the lemma. 


Proof of Theorem 6.3.1 


We now turn to the proof of Theorem 6.3.1. According to Theorem 6.1.1 
we need only evaluate 


(6.3.2) AS) Í flee x" |ó(x, 0, x”) dx dx”, 


where the second integral is over the region in ZF, K = N(N + 1)/2, in which 
x" is negative definite. Using the usual dependence relationships between X and 
its various partial derivatives (cf. 2.4.9 and 2.4.10) we have that (6.3.2) is equal to 


(6.3.3) A(S)Qn)- "? Ar [609 dx fidet x"|d(x"|x) dx”. 


Furthermore, let us assume for simplicity that the various first-order partial 
derivatives are uncorrelated. If this is not the case naturally, it can always be 
achieved by an appropriate orthogonal transformation of the parameter space 
(cf. Section 5.3) without affecting the mean number of maxima. Then A is a 
diagonal matrix, with elements 4,,..., Ay say, and |A| = å; --- Ay. Since all the 
variables have a joint Gaussian distribution the only effect of conditioning the 
X; of X” on X = x will be to shift the mean of the X ;; by an amount —xA,/o? 
and to change the covariance matrix in a way independent of x. Thus if we make 
the change of variables 


where ójjis the Kronecker delta, we can express the integrals of (6.3.3) as follows: 
(6.3.4) (— yf (x) ax det(v — xo ^ ?A)ó(v) dv, 
u Du 


where D, is the region over which v — xo^ ?A is negative definite and $(v) is a 
zero-mean Gaussian density independent of u. Consider the inner integral. We 
can write 


N 
det(v — xo ?A) = Y b,(v)x* = B(x), say, 
k 0 
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where B(x) is a polynomial in x and the b, are functions of the v; j. In particular 
by(¥), the coefficient of x", is (—1)%a~?%|A|. Thus we have 


> 


N 
det(v — xo ?A)ó(v) dv = Y ex 
k=0 


Du 


where 


Ch = b,{v)(v) dv. 


For large enough x we have, by Lemma 6.3.1, that, for any r, 


bio) dv < e x Í b,(v)@(v) dv. 
RK 


a(r) 


From this, it is straightforward to check that 


1 
ETE = o) < [eyl € a? Al. 


On substituting the above back into (6.3.3) and noting that each c, is finite, we 
find that (6.3.4) is equal to 


(6.3.5) a ?"|A| IL + o(=) eco dx. 
, x 


The asymptotic formula 


(6.3.6) Qno?) ("x ex NUN Lu ey u? 
a : P\ ~ 962 m) RL 3 


is easily established by integration by parts. Combining this with (6.3.3) to 
(6.3.5) yields 


A(S) A|2uN71 u? 1 
E{M,{(S)} = Qn Dagan P -3 1+0 z 
which establishes the theorem. 


The above formula has substantial practical significance, for a large number of 
applications of random fields are intimately involved with high-level maxima. 
It is interesting to note that we have already derived this rather involved ex- 
pression, albeit in a different guise. A quick check of the mean value of the DT 
characteristic of the excursion set A, of a Gaussian field (Theorem 5.3.1) shows 
that 


l 
(6.3.7) KAAN. SD = EM dS) - | ! o[ Ji 
rr 
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This result is not just a chance coincidence and in the following sections we shall 
investigate carefully why such a relationship should hold. 


6.4 EXCURSION CHARACTERISTICS ABOVE HIGH LEVELS 


Our first step towards investigating the reason underlying the asymptotic 
equivalence of the mean values of M,(S) and x( A,(X, S)) involves studying what 
happens to excursion characteristics as the level u becomes high. Throughout 
this section, except where stated otherwise, we shall assume that X (t)is a Gaussian 
random field satisfying the conditions of Theorem 5.2.2, which permitted an 
explicit evaluation of E{y(A,)}. The set S is assumed to be compact and convex. 
We shall start with the following theorem. 


Theorem 6.4.1 


Under the above conditions 


E{yw- VAX, S) = 1 


4.1 li 
(64) — lm EAX, S) 


(642) Im EGG S) 


where the y, are as defined in Definition 4.4.1. 


0, fork =0,1,...,N — 2. 


Proof 


Recall that x = (—1)%~! 32:3 (— D)*x, and yy_1(4,(X, S)) is the number of 
points in S at which 


X(t — u = X(t) =- — Xy ..(0-0 


and the matrix D = (X;;(t), Lj = L..., N — 1,has N — I negative eigenvalues 
(i.e. is negative definite). Thus the content-of this theorem is that the mean value 
of the DT characteristic is asymptotically equivalent to the mean number of 
points satisfying the above conditions. Thus, by (6.3.5), the mean number of 
such points is also asymptotically equivalent to the mean number of local 
maxima of X above the level u. This, of course, raises the possibility of there 
being a one-one relationship between these points and maxima, a problem that 
we shall consider in detail in the following section. For the moment, however, 
we shall content ourselves with less detailed, but more easily obtainable, results. 

To prove the theorem we note that by combining Lemmas 5.2.1 and 5.2.2 we 
have, in the notation of those lemmas, 


Ely} = ASX — 1 [stet d)ó(u. 0,.... xy. Z) dxy dz 
aN O 
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where the second integral is over all z e ANN- !? for which the matrix d has k 
negative eigenvalues. The same expression, but with the second integral over all 
z and ( — 1)* replaced by ( — 1)" !, gives E(y) (cf. Theorem 5.2.2). 

To simplify the expression we note that, as in Section 5.3, we can transform the 
coordinate system in such a way that the first N — 1 first-order partial deriva- 
tives are uncorrelated, and neither the mean value of y nor that of the y, are 
affected (cf. Lemma 4.5.1). Assuming that this transformation has been carried 
out, we have from (5.3.15) and the corresponding result for E{y,} that 


E{y(A,)} _ Seay det d $(z|u) dz 
EAD} Jaro- det d ó(z|u) dz 


(64.3) 


where z e U(k) c g?MN- ? if and only if d has k negative eigenvalues. Since the 
elements of D and Z are the second-order partial derivatives X7, i,j — 1,..., N — 1, 
we can transform these as in the previous section by setting 


ô 


juj sog 
3 ,)j— L...,N-1 
o? bJ 


vj = dij + 


where A; = E(X7]. Then (6.4.3) becomes 


E{x(Aw} _ fran det(v — uA)9() dv 


(644) EGA} — Jane det(y — uA)6( dv 


where A’ is the diagonal matrix with diagonal elements 4,,..., Ay..,, and 
Y(k, u) is the region over which v — uA’ has k negative eigenvalues. 

If we now expand the determinants in these integrals to obtain polynomials in 
u, as in the preceding section, it is straightforward to see that 


EAD 5 Trao PCY) dy 
(6.4.5) B Er e fan- ond() dv 


Consider the case k = N — 1. Then Y(k, u) is simply the region over which 
v — uA’ is negative definite, and so applying Lemma 6.3.1 to the numerator in 
(6.4.5) immediately establishes the first half of the theorem, ie. (6.3.1). To 
obtain the second half it suffices to note that if k < N — 1 then Y(k, u) is con- 
tained in the complement of Y(N — 1, u). Applying Lemma 6.3.1 then establishes 
(6.4.2) and the proof is complete. 


Let us consider the content of Theorem 6.4.1 more carefully in the special 
case N = 2. In that case, y1(A,) is simply the number of points in S satisfying 


X(t) — u= X,(0 = 0, X,,(t) <0. 
These are points on the level curves of X where the curve is tangential to a line 
parallel to the ¢, axis and convex, when considered locally as a function of t, 
to tj. Similarly, z,CA,)1s the number of points on the level curve where the same 
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tangency conditions hold, but the curve is concave. Thus, Theorem 6.4.1 is 
essentially saying that at most of these tangency points the level curve is convex. 
Indeed, since this theorem could be shown to hold under any rotational trans- 
formation of the parameter space, so that any choice of axes is available, the 
theorem would seem to say that in general the level curves are convex, rather 
than concave, over most of their length. 

To formalize this idea we need to introduce the curvature function x(t) 
defined by 


X2X,, —2X,X5 X45 + XiX5; 


(6.4.6) K(t) = — (X14 xin 


When the curvature at a point t on the level curve of X (at the level u) is positive, 
then in some neighbourhood of t the tangent line to the level curve does not 
contain any points of the excursion set A,. (For a proof of this and the following 
fact see, for example, Spivak, 1970.) 

One of the useful properties of this concept lies in a description it provides of 
the set A,. Suppose the level curve, 0A,, contains a closed curve, and suppose, 
furthermore, that at every point on this curve the curve has positive curvature. 
Then the curve is convex, in the sense that it always lies on one side of its tangent 
lines. Furthermore, if within the region it bounds there are no points at which 
X(t) « uthen this region is convex, in the usual sense. 

What we shall now show is that as u — oo the proportion of the level curve at 
which the curvature is positive tends, in a certain sense, to unity. Since Theorem 
6.4.1 implies that A, tends to contain relatively few ‘holes’ (since E(yo9(A4,)) = 
o(E{x,(A,)})) this tells us that above high levels A, tends to be composed of 
disjoint convex sets. 


Theorem 6.4.2 


Let X(t) bea two-dimensional, homogeneous, zero-mean Gaussian field, suitably 
regular for the level u in S. Let L, denote the arc length of 0A, and write Lj for the 
arc length of these segments of 0A, which have positive curvature. Then 


E{Ly} 
ar 

EXL,} 
To prove this we require the following lemma, which we have actually already 
used in Section 1.7. Recall that any straight line in the plane can be parametrized 


by two coordinates (p, 0), — oo < p < 0,0 < 0 < z,s0 that the equation of the 
line will be 


(6.4.7) 1 as u > oo. 


(6.4.8) t, cos 0 +t, sinf- p=0. 


This linc is of perpendicular distance p from the origin, and the angle between 
this perpendicular and the positive half of the t, axis is 0. If C is a curve in the 
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plane, we write Nc(p, 0) to denote the number of intersections of this curve with 
the line given by (6.4.8). We then have the following lemma. 


Lemma 6.4.1 


If C is a rectifiable curve in the plane and 4,(C) its arc length, then 
(6.4.9) A,(C) = af Í Nc(p, 0) dé dp. 
04—o 


Proof 


Consider firstly the case of C being a straight line of length L. Then N (p, 4) 
will be either zero or one for all (p, 0) except for the one value of (p, 0) for which 
the line with these coordinates actually contains C. We can ignore this case since 
it will not affect the value of the integral (6.4.9). Consider Figure 6.4.1. It is clear 
that if C is a segment of the line with normal coordinates (4, Y) then, for fixed 0, 
N-(p, 0) will equal one within an interval of length L -sin | — 0| and zero 
outside this interval. Thus, 


aff Nep. 0) 40 dp =4 | L -sin ly — 6140 
07—o 0 


ni2 
= il sin 0 d0 


0 
=L. 


This proves the lemma when C is a straight line. Clearly, then, the lemma must 
also be true if C is composed of any number of straight lines. To complete the 


Figure 6.4.1 
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proof it is only necessary to note that a rectifiable curve can always be approxi- 
mated, to arbitrary accuracy, by a polygonal curve, the length of which is given 
by our formula. Taking the limit of these approximations can be justified to 
obtain (6.4.9) for any rectifiable curve. Details of this argument can be found, 
for example, in Blaschke (1936, p. 46) or Sherman (1942). 


Proof of Theorem 6.4.2 


We are now in a position to commence proving Theorem 6.4.2. Firstly, note 
thatneitheranorthogonaltransformation ofthe field coordinates nor a change of 
scale of the t; or of X will affect the truth of the theorem. Hence, since by such a 
transformation and scale change we can make X , and X, uncorrelated, as well as 
setting the variances of X, X,, and X, to unity, we shall assume that these 
conditions hold. Note also that if we restrict X to any line in the plane, then 
under these conditions the variances of both the restricted process and its 
(directional) derivative are also unity. 

The second point to note is that since X is homogeneous there is no loss of 
generality involved if we prove the theorem for the special case when S is a disc 
of radius one centred on the origin. This substantially simplifies later compu- 
tation. 

For a line with normal coordinates (p, 0)let N (p, 0) be the number of crossings 
of the level u by the one-dimensional process obtained by restricting X(t) to the 
intersection of this line and the unit disc. Similarly, let N; (p, 0) be the number 
of these points at which the expression in (6.4.6), i.e. the curvature of the level 
curve, is positive. Then by the precéding lemma 


(64.10) L,=4 Í Í N (p, 0) dp dé, with probability one, 
07-0 


while a similar relationship exists between Lt and N;(p, 0). (Note that the 
conditions ofthe theorem guarantee that the level curves of X are rectifiable with 
probability one.) It is a simple matter to verify that any line with normal param- 
eters (p, 0) has a section of length 2(1 — p?)!? contained in the unit disc if |p| < 1, 
and fails to intersect it at all if p > 1. Thus, taking expectations on both sides of 
(6.4.10) and applying Theorem 4.1.1 we obtain, under the above assumptions on 
X and its derivatives, 


(6.4.11) E{L,} = [au [ siio 0)) dp 


TRE rl 
zl Jt — p^) (=)exo(- 5) p 
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We now turn to computing E(L; ). Note that by (6.4.6) L7 must be greater 
than Lj *, the total length of those segments on the level curve on which the 
matrix X" of second-order partial derivatives is negative definite. Applying now 
our ‘corollary’ to Theorem 5.1.1 (in this case most easily justified by reworking 
the proof of Theorem 4.1.1 to cover the following) we have 


E{L,} 2 EXL,} 2 EUG 5] 
2 
= $n exp( - 22 negative definite| X = u}. 


Applying Lemma 6.3.1 and arguing as in the preceding theorem establishes that 
the probability here tends to one as u — oo. This proves the theorem. 


To round off the current discussion and to prepare ourselves for the following 
sections, let us consider what shape the field itself takes over a high-level excursion 
set A,. Note that X(t) will be concave over any convex set in #? if the matrix 
X"(t) is negative definite over the same set. Thus, if we could show that the pro- 
portion of A, over which X"(t) is negative definite tends to unity as u — oo it 
would follow that, since A, tends to be composed of disjoint convex sets, X(t)isin 
general concave at high levels. We shall prove the following theorem. 


Theorem 6.4.3 


Suppose X satisfies conditions of the preceding theorem. Let M, denote the 
Lebesgue measure of A, and M, the Lebesgue measure of that portion of A, 
over which X'"(t) is negative definite. Then 


E{M,} 
E(M,} — 


1, as u > oo. 


Proof 


Define two new zero-one random fields by 


if X(t) > u, 
t = 
ro i otherwise, 


nqo- 1 if X(t) > 0 and X"(t) negative definite, 
dE otherwise. 


Then clearly 


M, [roa M, [roa 
N VN 
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Thus 
E{M,} = A(S)P(X(0 > uj, 
E{M,} = A(S)P{X(t) > u, X"(t) negative definite}. 
Hence, in obvious notation, 
E{M,} _ frou $(x) dx f ó(x" lx) dx” 
E{M,} fx>u P(x) dx 


where the second integral on the numerator is over the region in which x” is 
negative definite. Applying Lemma 6.3.1 as in the preceding theorems establishes 
this theorem also. 


6.5 ERGODICITY 


In Section 6.7 we shall conduct a careful investigation of the structure of a 
Gaussian field in the neighbourhood of a high-level maximum, during the course 
of which we shall need to compute conditional probabilities in cases where the 
conditioning event is of the form ‘X has a local maximum at the point t = O. 
Since, in general, there will be zero probability of a maximum occurring at 
precisely t = 0, this means we wish to define a conditional probability P{A | B} 
when P{B} = 0. One way of defining such probabilities is to consider a sequence 
of events B, of non-zero probability which converge to the event B. Then one 
may define 

P{A|B} = lim P{A|B,} 
if this limit exists. This limit need not be unique and may depend on the par- 
ticular sequence {B,,}. For a detailed discussion of this problem see either the 
original discussion of Kac and Slepian (1959) or of Cramér and Leadbetter 
(1967, Chap. 11). 

To justify the choice of sequences (B,) that we shall be making to condition 
on the event B = {X has a maximum at t = 0} it is necessary to digress for a 
short time to develop some ergodic theory for random fields, which forms the 
content of this section. In the following section we shall conduct a brief investiga- 
tion of some simple properties of point processes on Æ”. We shall not give 
detailed proofs of the results in either of these sections, since in all cases the 
proofs are straightforward extensions to N dimensions of the proofs of the 
corresponding results on the line. We shall, however, give detailed references to 
these proofs, as well as brief outlines for the reader unfamiliar with this material. 

To set up the machinery of ergodic theory we let X be a strictly homogeneous 
(N, d) field and choose as our probability space the space (G2, &N:?, P). 
Recall that G*’4, as in Section 1.5, is the set of all finite, @*-valued functions on 
BN. GN? is the smallest c field containing sets of the form {g e G^: g(t) e Bj, 
jd... k}, where k is arbitrary, t; € 4%, and the B; are half-open intervals in 
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2^. Finally, P is the probability measure uniquely defined on all sets of gM? by 
the fi-di distributions of the field X. 

With this choice of probability space the elementary events, œw, of the space are 
now individual sample functions, say œ = g(t). We define a family of commuting 
shift transformations TO, j = 1,..., N, on this space by setting 

TOS, vts ty) = beo ET.. ty). 


Similarly, each T® takes any set S € 4": into a set SY composed of the functions 
of S shifted by t at their jth parameter. 

It follows from the strict homogeneity of X that these transformations are 
measure preserving in the sense that P{S} = P{S”} for any j, t, and any S e $7. 
A set S e 4:7 is called an invariant set of X if, for every j, t, the sets S and SY 
differ, at most, by a set of P-measure zero. That is, there exist two sets N, and 
N3, both of which have measure zero, for which 


S U N, = SYMON). 
It is not hard to show that the invariant sets form a ø field contained in 4^? 
and that all sets of probability zero or one belong to this ø field. This leads us to 
the following definition. 
Definition 6.5.1 
A strictly homogeneous (N, d) random field X is said to be ergodic if the o field 


of invariant sets only contains sets of probability zero or one. 


The importance of the notion of ergodicity lies in the following theorem, in 
which, as usual, o(T) denotes the sphere in # of radius T > 0, and By denotes 
the volume of o(1). 


Theorem 6.5.1 (Ergodic theorem) 

Let X be a strictly homogeneous, ergodic, (N , d) field. Then if 
(6.5.1) E(|X'(0]) < oo 
for each component field X'(t), and 


(6.5.2) with probability one, the vector-valued Riemann integral 


Í X(t) dt exists for every 0 < T < oo, 
a(T) 


then 


(6.5.3) X(t) dt ^ E{X(0)} a.s., ds Hc o. 


BsF” Jar 


where the right-hand side is a vector of expectations. 
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The content of this result is simply that averaging a particular sample function 
over Æ” is equivalent to taking expectations, a result of vast practical importance. 
Indeed, in the applied literature (6.5.3) is often taken as the definition of ergodicity. 
For a proof of this and the following results when N = d = 1, see virtually any 
text on probability theory or stochastic processes. For a partial proof of the 
above result for any N and d see Wiener (1939). A full proof is easily obtained as 
a straightforward extension to that used in the case N = d = 1. 

The power of the concept of ergodicity lies not only in the convergence of the 
above integral but the fact that it also implies the convergence of a wide class of 
related integrals. Any random variable Y defined as a measurable function of the 
random variables X(t) for any set of values of t will be called a random variable 
defined on the field X. Then Y will always be a Y"’4-measurable function of the 
elementary event o — g(t), and we can define shift transformations of Y by 
writing 


T9 Y(o) = Y(TPo) = Y(g(t,,..., t; + 5... ty). 
Setting 
(6.5.4) Y(t) = T(V -.. T(PY(o) 


determines a new strictly homogeneous (N, d) field, and, furthermore, the follow- 
ing result is true: 


Theorem 6.5.2 


Let X be strictly homogeneous and ergodic. Then the random field generated 
by the shift transformations of (6.5.4) is also strictly stationary and ergodic. 


To see the importance of this theorem, let us consider an example related to 
excursion characteristics. Fix a level u and let X be a suitably regular, ergodic, 
Gaussian field satisfying all the conditions of Theorem 5.3.1, so that we can 
compute the mean value of the DT excursion characteristic y, as well as being 
able to write down integral expressions for the mean values of its components 
Xr- Set Y, = y, (A,CX, I,)). Then, since each Y, can be expressed as a limit of 
functions defined on the field (cf. Theorem 5.1.1), the field ¥,(t) defined by (6.5.4) 
is, by the above theorem, also ergodic. But 


N 
Y¥,(t) = ase TIt, ti + 1]: X(9 > 4). 
i21 
Hence, it is straightforward to check that, for any T > 0, 


j) Y(t) dt < (AX. o(T)) < Í Y(t) dt. 
a(t SN) a 


(TEN) 
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Applying the ergodic theorem to the extreme terms immediately implies 


XC AX, o(T)) 


BOTY > EUQ(X, I} as, as T> oo, 
N 


from which we immediately have 


X(AKX, o(T)) 


6.5.5 
( ) B, TF 


> E{y(X, I,)} a.s., as T > œ. 

The implications of this result are obvious. For example, we now know that 
the estimators of |A|!? based on excursion characteristics and discussed in 
Section 5.5 are, for an ergodic field, consistent. We shall turn to other applica- 
tions of the above theorems in the following section. 

In general it is not easy to give a simple condition which ensures ergodicity. 
For Gaussian processes, however, this can be done, so let us now assume that 
X(t) is a real-valued Gaussian field with zero mean, and such that its sample 
functions are continuous with probability one over every compact subset of 
A~. For such a field, and for the corresponding covariance function R(t), we 
have the spectral representations (2.4.2) and (2.4.1). An important result, due to 
Maruyama (1949) and Grenander (1950) for N — 1, and easily extendable to 
general N, is as follows. 


Theorem 6.5.3 


If X is a Gaussian field as described above, it will be ergodic if and only if its 
spectral distribution function is everywhere continuous. 


The proof of the ‘if’ part of this result, while technically tedious, is, in principle 
at least, straightforward. It is a result of Fourier analysis that the continuity of 
the spectral distribution function implies that 


lim rf |R(t)|? dt = 0. 
To e(T) 

Thus, on average, R(t) tends to be small if |t|| is large. This means that the 
variables X(s) and X(s — t) will tend to be independent if [t| is large; this 
property is enough to establish ergodicity. In fact, it is also possible to prove the 
following result, a proof of which is implicit in the argument of Cramér and 
Leadbetter (1967, p. 158) for the case N = 1. 


Theorem 6.5.4 
If X is a Gaussian random field, as described above, it will be ergodic if 


(6.5.6) R(t) +0 as ll » x. 
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This essentially completes our discussion of ergodicity. We note, however, 
two points. Firstly, the ergodic theorem holds if the region of integration in 
(6.5.3) is not o(T) but the interval I(T) = (te 27:0 < t; < uT}, for any set of 
l; > 0. In this case the norming constant is not By T" but A(I(T)) = T" [|X uj. 

Secondly, results analogous to those of Theorems 6.5.3 and 6.5.4 also hold for 
9?^-valued Gaussian fields. In this case both the covariance and spectral distribu- 
tion functions become matrix-valued functions. The necessary and sufficient 
condition of Theorem 6.5.3 is replaced by the everywhere continuity (in the 
appropriate space) of the spectral ‘distribution’ function, while (6.5.6) can be 
replaced by 


(6.5.7) E{X(t + Xs) +0 — as [tl > oo 


for every i, j, where X’ denotes the ith component of the vector field X. We shall 
encounter vector-valued ergodic fields in Chapter 8. 


6.6 POINT PROCESSES 


Consider the following set-up. We have a probability space (Q, F, P) and 2 
with its Borel g algebra 4". For each w € Q, S(@) is a set of non-coincidental 
points in Z". If, for each B e A", 


N(B) = N(B, o) = card(B ^ S(o)) 


(where card(A) denotes the number of points in, i.e. cardinality of, A) is a well- 
defined random variable, then S(w) is called a random set and the family 
{N(B): B € 4") a point process on 2". The ‘events’ of this process are, of course, 
the points S(c). We shall be interested, for example, in the case in which the S(w) 
are the local maxima of an N-dimensional random field, and N(B) is simply the 
number of such maxima in the set B. 

There exists a large literature concerned with these processes, most of which 
is summarized in the monograph of Kallenberg (1976). We shall, however, 
content ourselves with only a few of the most basic results of this theory, full 
proofs of which are given in Leadbetter (1972). For proofs in the case N — 1, 
which is notationally simpler, see also Cramér and Leadbetter (1967). 

Let @ = {Cx} be the collector of cubes of the form 


CuK = (te AN: kn < t; < (k; + 1)/n, i = 1,..., N}. 
Then, clearly, Z” = |), C4, for any n and A(C,,) = n^" for any k. We shall say 
that the point process N is regular (or orderly) with respect to € if 
. P{N(Cm) > 1j 
6.6.1 lim sup ———-———— = 
Ce) um VP PONC) > 0) 


(For simplicity, we always assume P{N(C,,) > 0} # Ofor any n, k.) Essentially, 
what this says is that the probability of there being more than one event in a small 
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cube is of lower order than the probability that there is only one event, so events 
tend to be separated in space. 

Finally, for any set B c Ø” let B + t denote B translated by t. Then we shall 
say that N is homogeneous if for any B, € BY, k = 1,...,m, m > 0, the joint 
distribution of N(B, + 0,...,N(B,, + © is independent of t. Then we have 
the following theorem, due originally to Khintchin, Dobrushin, and Korolyook 
for the case N — 1 (reported in various forms in Khintchin (1960) and Volonski 
(1960)), and Belyaev (1969) and Leadbetter (1972) for general N. 


Theorem 6.6.1 


Let N be a homogeneous point process and € = {Cx}. Then the process is 
regular and, furthermore, the following holds, for any te A“, 


(6.6.2) jg Ee Cu +9 za) 


n> 


= E{N(I,)} 


where k is arbitrary and I, is the unit cube in 2^ . This result continues to hold even 
if the right-hand expectation is infinite. 


As a consequence of this result we have, in view of (6.6.1), that if the conditions 
of the theorem hold 
(6.6.3) lim sup P(N(C,) > 1} = on™). 


n>o k 


These are all the results about point processes that we shall require. 


67 THE LOCAL STRUCTURE OF HIGH MAXIMA 


The results of Section 6.4 would seem to indicate that, at least at high levels, 
the sample functions of Gaussian random fields tend to have a comparatively 
simple structure. The existence of such results should mean that if one observes a 
realization of the field at only a single point, t* say, then if X(t*) is large one 
should be able to say something useful about the shape of the sample function 
in the neighbourhood of t*. To be able to do this one generally needs to know 
not only the value of X(t*) but also the values of the first-order derivatives of X 
at t*, as well as possessing some information about the values of its second-order 
derivatives. In this section we shall look, in some detail, at the special case in 
which we observe X at a local maximum of its sample function. This case, 
aside from being algebraically less involved to deal with than the general case, 
is of singular importance in many areas of application of random fields in which 
it is only the behaviour of a field in the neighbourhood of a high maximum 
that needs to be studied. Even for this case, however, we shall not give all the 
details of all the results, for it is unfortunate, but unavoidable, that to move from 
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our previous heuristic studies of high-level excursions to the more rigorous 
study we were about to commence requires us to increase the difficulty of the 
algebra by many orders of magnitude. We shall follow the work of Lindgren 
(1972) for our discussion of maxima. 

Throughout this section we shall assume that we are dealing with an N-di- 
mensional, real-valued, homogeneous Gaussian field, with mean zero and 
satisfying the condition 


(6.7.1) max E(|X;t) — X,(0)) < CItl? 


BJ 


for some finite C > 0 and all t with |t|| small enough. This is a somewhat 
stronger condition than that which we usually impose, and, in fact, leads to the 
following lemma. 


Lemma 6.7.1 
Let X be as described above and suppose (6.7.1) is in force. T hen if 
o*(q) = max supt| X;(s) — Xi(|:s te L, ls — tl < m 


LJ 


we have 
n "P(o*(y) > ge ND 20 — asn +0 
for every & € (0, 1). 


Proof 
We need only consider one pair (i, j) at a time. Fix this and set 
p^(u) = sup(E(IX; (t — X01): ltl] < w/N}. 
Then (6.7.1) implies 
(6.7.2) plu) < C? Nu. 


Writing œ(n) for the modulus of continuity of the particular X;; under considera- 
tion, we have, from Theorem 3.3.3, that œ(y) is bounded, with probability one, by 


" 
16,/N [log B]'/7p(n) + 16,/2N Í (—log u)!? dp(u), 
0 
where B is a random variable satisfying E{B}* < (4/2). Letting K denote a 
(N-dependent) constant which may change from line to line we obtain, on 


applying (6.7.2), and integrating by parts, that 


n 
on) x K[log B + (—log )!'?]'/4n + K Í (—log u)” !? du. 
0 
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(cf. 3.4.8). Making the substitution z = (—log u) and noting that 


ao ao 
Í z Ve "*dz«u' 1? Í e*dzzxu Ve" 
u u 
yields that for n small enough 


1\712 
oln) < K[log B]'?5 + SU ;) n. 


Thus, for any « > 0 and small enough r, we have 
n "P(o(g) >a} € 1^ "P(B > exp([K tan! — (—log n) !?]^ 
< (4/2) Nexp( - NLK tan! — (log n"? p), 


(N+ e)/(N+ 1) 


on applying Markov's inequality. Setting « = y yields 


lim g~ PON) > NF DY 
n>0 


l 120 7 94N 9 1) 
< (4/2) lim nYexp| -4NK- 9 l 
ne 


=0 


This suffices to establish the lemma. 


It is easy to check, from Theorem 3.3.3, that (6.7.1) ensures X and its first- and 
second-order partial derivatives are continuous with probability one, while the 
lemma we have just established ensures that the remaining conditions of Theorem 
6.1.1 are satisfied, as long as the joint distribution of X and these derivatives is 
non-degenerate. Thus, under this latter assumption, the formula given in that 
theorem for the mean number of local maxima above a level u is valid. It is also 
true that under (6.7.1) and the above conditions on X the validity of other 
expectations we shall write down later can be established (cf. 6.7.5). 

In order to study the structure of local maxima we shall need to investigate the 
conditional field 


XQ) = XOA, 


where A denotes the event that X has a local maximum with height u at t = 0. 
The conditioning event is actually the intersection of two events of probability 
zero— X(t) has a local maximum at t = 0 and X(0) = u—and we acknowledge 
this by approximating A by the events 


Ah, ft’) — X has a local maximum with height in (au + h) 
at some point s in the sphere (ti - A 
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Eventually, we shall let h’ and then h decrease to zero to obtain conditional on A 
probabilities as limits of conditional on A(h, h’) probabilities as described in the 
preceding section. In fact, since to specify X, we need only specify its finite 
dimensional distributions, we can obtain these by setting 


P(X,(€) € v, i = 1,...,k} = lim lim P{X(t € v, i = L..., k| A(h, h^) 
hlo wo 
where n > 1 and t € Z", v; € 4!. The only problem, therefore, is to obtain a 
simple expression for the above conditional probabilities. But if we now write 
A(v, h, h’) to denote the event that A(h, h^) occurs and X(t') < vj, i = 1,...,n, 
then we are seeking 


...—— P(A(s, h, AY} 
6.7.3 lim lim ———————7—. 
iu alo wio PIAC, h)} 


Let B(v, h, h^) denote the event A(h, h’) occurs and X(s + €) < v, i = 1,...,n. 
We claim 


0. 


(674) jp UEM E 
a Œ) 


The proof of this result is not difficult, but rather messy. Lindgren (1970) covers 
the one-dimensional case in detail and the N-dimensional case follows in a 
similar fashion using Lemma 6.7.1. We shall not give it here. 

The importance of (6.7.4) lies in the fact that whereas it is easy to determine 
P(B(v, h, h')} and P(A(h, h’)} this is not the case for P(A(v, h, h')). To determine 
the former probabilities we use certain point processes and Theorem 6.6.1. Fora 
compact S c # and T > 0 write 


M(S, u, h) = # {local maxima se S of X at which X(s) € (u, u + AW}, 
M(S, v, u, h) = # (local maxima se S of X at which X(s) € (u, u + h) 
and X(s + €) € v, i = 1,...,k}, 
M,(u, h) = M(E— T, T]", u, h), 
M,(t) = (R(t) i = L..., N; Ri, 5j = LL... Nui x j) 
Then it is immediate from the conditions on X that both M(., u, h) and M(., v, u, h) 


are homogeneous point processes and so, restricting k’ to be a rational number, 
Theorem 6.6.1 and (6.7.4) imply 


fim PLA IIO) 4 PO, h YI" 
wio PLAG HD) wio PLAG, I) IC 


" E{M,(v. u, hy} 
T ELM ya Wy} ^ 
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Thus determining the limit in (6.7.3) is reduced to obtainingan expression for the 
ratio of two expectations and then letting h | 0. 

If the random field is also ergodic, then this ratio is also the limit, with proba- 
bility one, of M (v, u, h)/M q(u, h) as T > oo. Thus, in this case, it has a natural 
frequency interpretation as the distribution of the X values at points s + t 
where the s are the locations of local maxima with a height in (u, u + h). This 
provides the motivation of the use of (6.7.3) as a conditional distribution. 

To obtain a more useful form for our conditional probability we require 
closed forms for the above expectations. For these, we shall as usual write X' 
to denote the N vector of first-order derivatives of X, while X" stands, inter- 
changeably, for either the N x N matrix of second-order derivatives or the 
[NON + 1)/2] vector of these derivatives. On this occasion, however, we write 
this vector in a very special way, by placing the homogeneous derivative X;; 
in the ith position and then filling the remaining N(N — 1)/2 positions with the 
mixed second-order derivatives X;;, i # j. If we now make the simplifying 
assumption that X has unit variance, it follows that the covariance matrix of 
the [1+ N + N(N + 1)/2] vector (X, X', X") has the following natural 
partitioning: 


1 0 Moz 
0 A 0 
Moo 0 M22 


say, where A is the usual matrix of second-order spectral moments, M3, is 
made up of fourth-order spectral moments, and My, = M7, contains second- 
order moments. If we now introduce two new vectors by putting 


M,(O zs ( - R,(0), tt, ~Ry(t)) 
M(t) = (Rt), i = L..., N; R(t), j = 1,..., Ni # j) 


where R, as usual, is the covariance function of X, then the 1+ N + 
N(N + 1)/2 + nvariates X(0), X'(0, X"(0), X(t’), ..., X(t") have a joint normal 
distribution with zero mean and covariance matrix 


1 0 Mo; R(t’) rome, R(t") 
0 A 0 MT(t) zz Mi(t") 
M, 0 M; Mit) M3(t") 


R(t!) M,(t') M(t!) l R(t" — t!) 


REDS MO. MO) ROS t) | 
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Writing t = (t’,..., €") and ,(x, x’, x”, v) for the joint density of these variates 
we have (via the ‘corollary’ of Theorem 5.1.1, justified as mentioned above) that 


E{M,(y, u, h)} c re fes [222 foldet x” |x, 0, x”, v) dx dx" dv 


(6.7.5) EÁM ,(u, h)] {uth (pIdet x"|ó(x, 0, x”) dx dx" 


where D is the region over which x" is negative definite. 
By letting h — 0 and combining this with what has gone before we have the 
following fundamental lemma. 


Lemma 6.7.2 
Under the conditions given above the conditional distribution of 
X(t)), ..., X(t"), 
given that X has a local maximum at 0 with height u, has the density 


zt f» idet x" | $.u, 0, x", v) dx" : 
= fpldet x” |ġ(u, 0, x”) dx" 


(6.7.6) P0, u) 


Although this result completely specifies the distribution of the conditioned 
process X,, (6.7.6) can be simplified into a form which gives considerable 
insight into the structure of X near a maximum. To begin this simplification, 
we write 


det x" ó( — x" |u,0) 


{a n "n if x" € D, 
(6.7.7) V (x^) =} fp det x” 6(—x"|u, 0) dx 
0 otherwise, 
Then (6.7.6) is equivalent to 
(6.7.8) Pel. u) = [ Vx stus 0, =x") dx" 
D 


To obtain a more useful form for y, we introduce the real-valued functions 
A(t), C(s, t), and a vector-valued function b(t) = (bi(t), i = L..., N; bit) 


1 
(6.7.9) (A(t), b(®) = (RO, wo 


1 MaV!/ Rit) 
(6710) C6, 9 R - 9 - (RO). MA) (yw, bano] 


— M,(SA M(t). 
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Then it is a simple matter to check that @,(v|u, 0, — x”) is a normal density with 
means uA(t) — x" - (b(t))" and covariances C(t’, t (cf. 1.6.5 and 1.6.6). The 
density v, comes out similarly: $(x"|u, 0) is normal with mean 


1 0! 
(6.7.11) Mao: Ofo N (5) = ito. 


and covariance matrix 


1 0\"'/M 
(6.7.12) M22 — (Moo, ofo N ( a = M5; — M39 Mo; = M2.0, Say. 


Thus, for x” e D we have 
(6.7.13) W(x") = k; ! det x" exp[—4(x” + uMy.)Mz.4(x" + uM20)}], 


where k, is the obvious normalizing constant. Note that the assumptions made 
about the non-degeneracy of X and its partial derivatives guarantee the non- 
singularity of every interesting matrix. 

It is now convenient to determine the characteristic function of the density 
(6.7.6). Putting s = (s,,..., 5,) we have the characteristic function 


BAS, u) i NC à v)p.v, u) dv 


= [ool f exp(is - v).(v|u, 0, —x^) «| dx". 
D E 


Using the above information about the densities y, and $, we obtain, on writing 
A for the n vector with the jth element A(t’), B for the n x ((N + 1)/2) matrix 
whose jth column contains the vector b(t’, and C for the n x n matrix with 
elements C(t’, €), that 


Bas, u) [vale VLexptusa” — sx"B — js" Cs)] dx’ 


exp(usA" — isCs") Í exp( — sx" By (x^) dx". 
D 


The first factor here is simply the characteristic function of an n variate Gaussian 
variable with means uA(t') and covariances C(t’, t). The second factor is only 
marginally more complicated, being the characteristic function of the n-di- 
mensional random variable x"B, where x" has the density y, and B, of course, is 
non-random. Since characteristic functions determine distributions and fi-di 
distributions determine a process, we have proven the following important 
result. 
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Theorem 6.7.1 


Let X be a Gaussian field as described above. Then, given a local maximum 
with height u at 0, the conditional field X (t) has the same fi-di distributions as the 
field (X, (6€, te Z" defined by 


(6.7.14) X(t) = uA(t) — Z,b'(t) + Y(t), 


where Y is a non-homogeneous, zero-mean, Gaussian field with the covariance 
function C specified by (6.7.10) and Z, is an [N(N + 1)/2]-dimensional random 
variable, independent of Y and with the density w,, determined by (6.7.13). 


This result forms the foundations of a large structure of information that can 
now be obtained about the behaviour of X in the neighbourhood of a local 
maximum. We shall consider only one example in any detail, that of the case 
when u — oo. For this, we rewrite (6.7.13) in a form better adapted for this 
purpose, dropping the tilda from X,: 


(6.7.15) X,(t) = uR(t) — W,b'(t) + Y(t), 


where W, = Z, + uM;,. Note that (6.7.9) implies R(t) = A(t) + M;ob!(t) so 
that the new form is equivalent to the old. 

It is clear from (6.7.15) that as u — oo the term Y(t) exerts little influence of 
X (€), whereas the first term of (6.7.15) is extremely important. The role played 
by the term W, b'(t) is unclear. To clarify this, we prove the following lemma. 


Lemma 6.7.3 


Asu  oothe vector W, of (6.7.13) converges in distribution to an [N(N + 1)/2]- 
dimensional Gaussian random variable, independent of the field Y, having zero 
mean and the covariance matrix M,., defined by (6.7.12). 


Proof 


The vector W, has the density y*(w) = w,(w — uM;g). Consider now wand 
W, as symmetric matrices, formed from the vectors w and W, in the same way 
the matrix x" is formed from the vector x". Then the vector w — uM, is equiva- 
lent, in this sense, to the matrix w — uA, so that, by (6.7.13), 

(6.7.16) u "k,*(w) = u^ "det(w + uA)exp( -àwM;. bw?) 


for w + uA negative definite. Clearly, as u — oo, the right-hand side tends 
pointwise and with dominated convergence to 


det A exp( - j: WM, 1w"), 
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and so it follows that there exists a finite k,, for which 
u "k, > det A fexpc—dwmtz.bw") dw = det A-k,,. 


From this it follows that 
Vw) > kz! exp( -jwM zw) 


with dominated convergence, which is the content of the lemma. 


The content of this lemma is, of course, that the random term W, in (6.7.15) 
is of moderate order for all values ofu. Hence, it follows that in the neighbourhood 
of a high maximum at a point t*, say, the random field looks, in probability, like 
uR(t — t*), with random oscillations of O(1). This fact can be used to make a 
variety of precise statements about the behaviour of X,,(t) once further restric- 
tions are placed on the covariance function. 

In the paper on which the above discussion is based (Lindgren, 1972), a 
variety of such statements are made. However, as virtually all of them involve 
substantial algebra to be proven, we shall content ourselves with merely 
describing them, referring the reader to the original paper for details of their 
formal statements and proofs. 

A basic result is the following. If I c # is compact, 


I,= Ia {te 2: |e > e} 
and if, for all e > 0, 


inf max |R(€| > 0 


tel; 1xixN 


then, as u > oo, 
P{X,, has a critical point for some t € I, t # 0} > 0. 


That is, X, only has critical points when R(t) does. 

This result provides a building block for many others. For example, suppose 
that R(t) has a local minimum at some point t* and that in the neighbourhood 
of t* the matrix (0^R(t)/0t; 0t;) is positive definite. Suppose, furthermore, that 
within this neighbourhood R(t) looks like a quadratic form in the (t; — tf). 
Then X,(t) will have local minimum at some point t" near t*, and c" — t* is, 
after appropriate normalization, asymptotically an N-dimensional Gaussian 
variate as u > oo. 

If the field X is isotropic, then since R(t) depends only on ||t|| it can have no 
strict local minima. This, of course, does not rule out the possibility that X, has 
strict local minima, but the locations of these are harder to determine than in the 
non-isotropic case. Nevertheless, some results can still be obtained. For 
example, consider V(t) as t runs along some ray out of the origin, and let t, 
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be the value of ||t|| at the first local minimum of X, in that direction. Then if R 
has (non-strict) local minima for |t|| = t*, the asymptotic distribution of t, — t* 
can be obtained as u — oo. 

As well as studying the structure of a random field in the vicinity of a high 
maximum, Lindgren has also applied Theorem 6.7.1 to the study of low maxima 
as u — oo. For example, he shows that under further restrictions on the 
covariance function X,(t) tends, as u — oo, to a certain fourth-degree poly- 
nomial in t,,..., ty with random coefficients. We refer the reader to Lindgren's 
paper for precise statements of this and the preceding results. 

Finally, we note that in a number of papers, including those of Lindgren 
(1975, 1979) and de Maré (1977), theorems analogous to Theorem 6.7.1 have 
been used in the one-dimensional case to develop various results about the 
prediction of extreme values of random processes. No such study has been 
undertaken for random fields, although this would seem to be an interesting and 
important area. 


68 THE LOCAL STRUCTURE OF EXCURSIONS ABOVE 
HIGH LEVELS 


At the beginning of the previous section we expressed the hope that, since 
excursions above high levels seem to take on a reasonably simple form, it should 
be possible to say something worthwhile about an excursion, given the value of 
the field and its derivatives at one point within the excursion set. The remainder 
of the section was then devoted to the special situation in which it was observed 
that the field had a local maximum att = 0. A more general case has been studied 
in Nosko (1969a, 1969b, 1970), although he obtains, not surprisingly, less 
detailed results. Although Nosko's results have also been reported in Belyaev 
(19722), neither in this nor the original paper are any detailed proofs given. 
Nevertheless, we shall give the results here because of their extreme practical 
value. The method of proof used to obtain these results seems, from the comments 
of both Nosko and Belyaev, to be along the lines of that used by Lindgren, via 
conditional, ergodic distributions, but probably without as formal a *model 
process' as in Theorem 6.7.1. It would seem to be a useful project to develop such 
a model process in detail, particularly with a view to extending the prediction- 
type results mentioned in the preceding section. We shall not attempt to carry 
out this task here, for it would require substantially more involved algebra than 
that required in the last section, without producing, as far as our current interests 
are concerned, substantial new insights. 

Throughout this section we shall assume that the real-valued, N-dimensional 
Gaussian field X(t) satisfies all the conditions set out in the previous section so 
that Theorem 6.7.1 holds. Furthermore, we shall assume that X is ergodic. We 
again assume, for notational convenience, that X has unit variance. Then what 
Nosko has shown is essentially the following. 
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Theorem 6.8.1 


Given that the random field X(t) takes the value u at t = t and X((*) = d, 
i= L..., N, then, with probability approaching one as u > œ, the field has the 
following representation over that component of the excursion set containing t: 


1 
(6.8.1) X(t) = u + d(t; — 9" + ult — A(t — 1)" + o|), 
u 
where d= (d,,..., dy) and A is the usual matrix of second-order spectral moments. 


Consider the special case when d = 0. Then, if u is high, we know from Lemma 
6.3.1 that the critical point t will, with high probability, be a local maximum. 
Hence, Theorem 6.7.1 implies that in place of (6.8.1), with d — 0, we should have 


(6.8.2) X(t) = uR(t — t) — W,b!(t — t) + Y(t — 1), 


as in (6.7.15). However, if we expand R(t) as a Taylor series, the first terms of the 
expansion are simply 


1+ Xt — 9A(t — 9), 


so that there is, in view of Lemma 6.7.3, no disagreement between these results. 

Certainly, however, Theorem 6.7.1 gives the stronger result in this case. 
Consider now the special case N = 2, in which it is possible to see even more 

clearly the value of the above theorem. For notational convenience, let us 

suppose t = 0. Then the theorem says that the excursion of the field that covers 

t — 0 looks much like the following expression, with high probability (if u is 

high): 

(6.8.3) u + dt, + dot; + SulttAy,  2t05415 + 02452). 

This is simply a section of an elliptic paraboloid, lying on the plane z = u in 

the space 2? of points (t4, t;, z). It is a simple matter to check that this paraboloid 

has its peak at the point t*, given by 

412d; — À33di * À12d1 — Adz 


(6.8.4) pum = —*. 
: u(A, 1422 — Ài2) ? u(A411455 — Ai) 


At this point, the value of the paraboloid is 


1 À4,d — 24,5did; + A5; di 
2u À11422 — Aia 


as is easily verified by substituting (6.8.4) into (6.8.3). 

These results are, naturally, useful in practice, for they permit prediction of the 
height and location of a maximum from a single observation of the field and its 
derivatives somewhere in the excursion set. 

Another interesting. problem is the following. Suppose we observe that 
X(0) > u. Phen we know that 0 belongs to a component of the excursion set of X 


(6.8.5) u — 
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and somewhere over this component X must have a local maximum. What can 
be said about the (conditional) distribution of this maximum, if we require no 
information about its position? Clearly this distribution is equivalent to the 
conditional distribution of the height of a local maximum at 0, given that such 
a maximum occurred and exceeded u. This is given by the following result. 


Theorem 6.8.2 


Given that the field X(t) satisfying the conditions above has a local maximum 
at t = O with a height exceeding u, the conditional distribution of m, = X(0) — u 
(i.e. the excess height above the level u) is given by 
(6.8.6) lim P{um, > v|m, > 0} = exp(—v). 


uc 


Proof 


Let M(x) denote the number of local maxima of X in the cube [— T, T]" 
which lie above the level x. Then My is clearly derived from an ergodic point 
process, so that as in the preceding section we can apply the results of Sections 
6.5 and 6.6 to conclude that 


E(M (u + v/u)} 
E{M,(u)} 


Since we need only consider these expectations for large values of u, we can 
apply the asymptotic formula of Theorem 6.3.1 to them to obtain 


pak! 2 
lim EMU + vu) im EPLE + Pho /2] 
u co E{M,(u)} u> oo exp(—u /2) 
= exp( — v), 
which, together with (6.8.7), proves the theorem. 


(6.8.7) P(um, > v|m, > 0} = 


Nosko has results which go beyond (6.8.6) by applying Theorem 6.8.1. To 
state these, set y = |A|U?/2x and let S, denote the area of the ‘base’ of that part 
of the surface (6.8.3) lying above the plane z = u, while V, denotes the volume 
of the body bounded below by this base and above by the surface. Then con- 
ditional on S, > 0 and, respectively, V, > 0, Nosko's results state that both 
ju^S, and (2yu3V,)'/? have the same limiting conditional distribution as um,, viz. 
(68.8) lim P{yu?S, > v|S, > 0) = lim P{(2yu3V,)'/? > v|V, > 0} 


us co u> œo 


= lim P{um, > v|m, > 0} 


= exp(—v). 


Again, no detailed proofs are given, despite the fact that to go from Theorem 
6.8.] to (6.8.8) a functional himit theorem is needed. 
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69 ON THE GLOBAL MAXIMUM 


We now turn from our study of the behaviour of random fields in the neigh- 
bourhood of high-level local maxima and excursions to consider a quite different, 
although equally important, problem: the distribution of the global maximum. 
In particular, if we denote the maximum of a field X(t) as t varies over some 
region S c Z by 


(6.9.1) M(S) = sup(X(t):te S} 


then our aim will be to examine the distribution of the random variable M(S). 
Unfortunately, this is not a simple problem. For example, if X is simply a 
stationary Gaussian process on the real line, then the exact form of the distribu- 
tion of M([0, T]) is known in only five special cases; i.e. for only five covariance 
functions. Three of these are reviewed in Slepian and Shepp (1976), where a 
fourth is also presented, and the fifth is given in Cressie and Davis (1981). When 
we turn to random fields it is therefore perhaps not particularly surprising to 
find that there is not a single Gaussian field for which the exact distribution of 
the maximum is known. 

To obtain any information at all about these distributions we are forced to 
turn to asymptotics. Writing the exceedence probability as 


(6.9.2) F(S, u) = P{M(S) = u}, 


it turns out that we can find either the limiting form for F(S, u) as the level u — oo 
and the set S remains fixed or as S and u tend to infinity together, at a certain 
prescribed rate. The literature related to this problem in the one-dimensional 
situation is vast and, fortunately, has been recently given a full and unified 
presentation in Leadbetter, Lindgren, and Rootzén (1979b). Because of the 
existence of this work, and since the mathematics of the random field case differs 
only in minor (albeit laborious) detail from the one-dimensional situation, 
we shall not give the proofs of the results we are about to encounter. 

We shall consider two distinct situations. So far we have always assumed that 
the random fields we deal with have smooth sample functions, i.e. they are 
continuously differentiable, etc. This will be the first situation. However, it is 
not always true that such smoothness conditions hold, in which case, as we shall 
show in detail in Chapter 8, the type of excursion set analysis we have been 
doing so far becomes meaningless. Somewhat surprisingly, perhaps, the 
properties of the global maximum are not substantially altered, so we treat this 
second case here as well. 

Thus, let us commence by assuming X(t), t € A" is a homogeneous Gaussian 
field with mean zero and a covariance function that can be written in the form 


(6.9.3) ; RG) 1 (XU + ode), 
where 3 is a positive definite, symmetrie matrix, Then itis easy to see that X(t) 
is differentiable in mean square and that Lois related to the usual matr of 
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second-order spectral moments by o,, = 14;;. Suppose, furthermore, that there 
exist finite C,, C; > 0, such that for e small enough 


1 — RÐ 


6.9.4 Ci < 
( ) 1 TTE 


<C, whenever ||t|| < «. 
Then, setting 


(6.9.5) Yu) = Q/2nu) *exp(—4u’), 
Belyaev and Piterbarg (1972a, 1972b) have proven the following theorem. 


Theorem 6.9.1 


Let X be as above and suppose that (6.9.3) and (6.9.4) are satisfied. Then, if S 
is any interval in A“, 


. F(S,u) 
(696 im i 


where À is Lebesgue measure. 


= Qn) "^ | AJAS), 


(This is not exactly the form of the result presented by Belyaev and Piterbarg, 
in which there exists some confusion insofar as powers of 27 and |A| are con- 
cerned.) 

If we now suppose that the field X of the theorem has variance c? > 0, not 
necessarily equal to one, it is a straightforward consequence of (6.9.6) that 


li F(S, u) 
ww V (ufo )ujoy* 


Write K, to denote the right-hand side of this equation. Then taking logarithms 
and using the form of VY given by (6.9.5) we have 


N—1 2 2 
lim E F(S, u) — (2) any?) + | = log Ky 


= Qu "?|A['? AG). 


which implies 


(6.9.7) lim u^? log F(S, u) = —(207)71. 

This result, of course, contains much less information than does (6.9.6), since 
many of the informative terms of (6.9.6) have been lost. However, it can be shown 
that (6.9.7) holds under much weaker conditions than those imposed in Theorem 
6.9.1. In fact, (6.9.7) holds for any almost surely bounded, homogeneous Gaussian 
field, and extends even further, as the following result shows. (This was proven 
by Marcus and Shepp, 1972, extending, among others, the results of Fernique, 
1970, and Landau and Shepp, 1970.) 
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Theorem 6.9.2 
Let X be a Gaussian random field and S a region of R` for which 
P{M(S) « o} = 1. 


If 
of = sup E{|X() — EX(t)|?} 
te 
then 
(6.9.8) lim u^? log F(S, u) = — (20$) '. 


u—> æ 


Let us now return to the result of Theorem 6.9.1. With M,(S) and y(A,(X, S)) 
defined as usual we have that if the conditions of Theorem 6.3.1 are in force then 
(6.3.1) and (6.3.7) imply 


.. E{M,(S)} EET E(XA(X, S) _ —N/2) A 1/2 


assuming, for simplicity, that c? = 1. But the right-hand side here is 
identical with the right-hand side of (6.9.6). The reason underlying the phe- 
nomenon is simple to describe, although somewhat involved to derive rigorously. 
Essentially, if X(t) exceeds a high value in S, then it will tend to have only one 
high excursion set. This, in accordance with the last two sections, will tend to be 
associated with one local maximum. Hence 


P{sup X(t) > uj ~ P(X(t) has one local maximum in S above the level u} 
s 


^ P{the excursion set A, has one convex component}. 


(See, for example, Hasofer, 1978, to see how these equivalences can be easily 
exploited using heuristic arguments.) 

This type of asymptotic correspondence has been used in Piterbarg (1972) 
to study the asymptotic distribution of the number of connected components of 
excursion sets, as both the level u and the size of the set S becomes large. Thus, 
with yy- ,(A,(X, S)) defined as usual (cf. 4.4.6), the following result is a corollary 
of a result of Piterbarg. Its original version, in one dimension, is due to Volkonski 
and Rozanov (1961) and to Cramér (1966). 


Theorem 6.9.3 
Suppose (6.9.3), (6.9.4), and the conditions of Theorem 5.2.2 are in force. Let 


JA] u" 'expC hie) 


(2ny 60x 


(6.9.10) ulu) 
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and suppose 
(6.9.11) lim R(t)log(t) = 0. 


|| tll oo 


Then writing S(a) = [0, a]" we have 
The 
(6.9.12) lim Pixy LAX, SCT/u0))] = k = kl 


u—> oo 


for every integer k and real T > 0. 


The content of this theorem is relatively straightforward: the number of 
points t which contribute to make up yy. , has, properly normalized, a limiting 
Poisson distribution. Since this number is, at high levels, essentially the same 
as the number of components of the excursion set A, and the number of local 
maxima above u, these should have the same limiting distributions under the 
same normalization. This, however, has never been rigorously proven. 

Before we leave this result it is worthwhile to note that (6.9.12) continues to 
hold, in an obvious fashion, if S(T/u(u)) is replaced by an interval in 4" other 
than a cube. Secondly, in one dimension, it is known that (6.9.11) can be re- 
placed by a condition of the form 


(6.9.13) Í |R(t)|? dt < oo, 
RN 


and there is no reason to suppose this cannot also be done here. 

Using Theorem 6.9.2, Piterbarg (1972) has obtained an asymptotic expression 
for F(S, u) as both S and u tend to infinity. Again, his result is slightly incorrect 
as regards some of the normalizing constants, and is corrected below. A slightly 
different and independent proof of essentially the same result is given in Bickel 
and Rosenblatt (1973). To state this result, let 


(6.9.14) A(T) = QN log T) 2, 
(69.15) B(T) = (2N log T)! 
{N — 1)(log log T) + logL|A| A(N/2)^ - ? Qn)! 
Ü QN log T)? 


Then we have the following result which essentially dates back to Cramér (1965) 
for N =1. 


Theorem 6.9.4 


Let X(t), t € Z“ be a zero-mean, homogeneous Gaussian field with covariance 
function satisfying (6.9.3), (6.9.4), and either (6.9.1 1) or (6.9.13). Then 


(6.9.16) lim p B M ut exp] exp u)l. 


I 


AT) 
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In the one-dimensional case the conditions (6.9.11) and (69.13) relating to the 
rate of decay of R(t) as ||t|| ^ oo have recently been slightly weakened, while 
retaining the one-dimensional version of (6.9.16). (See Leadbetter, Lindgren, 
and Rootzén, 1979a, and Mittal, 1979, for details.) Again, despite the lack of a 
proof, there seems to be no reason why similar results should not hold for fields 
as well. 

The preceding three theorems summarize what is known about the distribu- 
tion of the maximum of smooth random fields. For applications of these results 
to reliability-type problems we refer the interested reader to Belyaev (1970, 
19722), Belyaev, Nosko, and Svesnikov (1975), and Bolotin (1969, 1973). We now 
turn to fields which do not satisfy (69.3), the condition which ensured the 
existence of smooth sample functions, and see what sort of results pertain. 

There are two distinct formulations of the more general problem, one due to 
Belyaev and Piterbarg (1972a, 1972b), the other to Bickel and Rosenblatt 
(1973). We shall give a version of the latter only. 

To weaken (6.9.3) suppose the covariance function R(t) can be expressed as 
follows, as |[t|| ^ 0: 


(69.17) R(t) = 1 — itl if | (si) 0 | dF(9) + o(lit]9. 


whereO < « < 2and F is a distribution on the unit sphere S" in 4" such that the 
integral form on the right-hand side of (6.9.17) is non-singular. 

When « — 2 this representation simplifies considerably, for if we define the 
parameters o;;, i,j = 1, N, by 


sx 
it is not hard to see that (6.9.13) simplifies to 


(6.9.18) R(t) = 1 — tEt + o(|t|), 


so that we are back in the previous situation. 


Jj? 


When « « 2 it follows from Section 2.2 that X cannot be mean square dif- 
ferentiable and, as we shall see in detail in Chapter 8, possesses quite erratic 
sample functions. 

Now let Y(t) be a Gaussian field with 


E(Y(0) = — ltl? f. (ia) 6 


and covariance function 
s 
( | l j 
Iis i 


Ry(s, t) = isi’ | 
ls uf » (jo 
iS 


eS N 


" dF( 


"u 


" dF(0) 


dF(0) + iu 


SN 


d (0). 
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We use this field to define the strictly positive constant 
(6.9.19) H, = lim r=» f Plur Y(t) < sle ds. 

T> o 0 S(T) 


Although this constant figures prominently in the forthcoming result, nothing 
is known about it (even when N = 1) except for two special cases, as in the 
following lemma. 


Lemma 6.9.1 
(a) If x = 2, so that (6.9.18) holds, then 

H, = mP E = Qn) "? At. 
(b) Ifa = land 


N 
RŒ) = 1 — Clty] +--+ nb) + of D 
i-1 


then 
A, = CN 


For all other values of «, we must accept H, simply as an unknown constant. 
It is used to define the norming sequences 
AAT) = A(T) = (2N log T) !? 
BAT) = QN log T)!? 


2 


1 2N 
+ E (> — 1 )dog log T) + log[(2r) 7H, QNYICN/— v 
QN log T)!? l 
Clearly B;(T) is identical with the B(T) of (69.15). We can now state the 
following theorem. 
Theorem 6.9.5 


Let X(t), te A“ be a zero-mean, homogeneous Gaussian field with covariance 
function satisfying (6.9.17) and 


Í | R(t)? dt < oo. 
RN 


Then 


d pL T D < «| = exp[ - exp( —u)]. 
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For an extension of this result to certain non-homogeneous fields see Bickel 
and Rosenblatt (1973), where the asymptotic distribution of normalized minima 
of fields is also discussed. 

There seem to be only two non-asymptotic results available. The first 
relates to a field known as the Brownian sheet. This field, which we denote by 
W(t) and shall discuss in detail in Chapter 8, is the zero-mean Gaussian field 
defined on [0, oo)" with covariance function 


N 
E(W(sW(0)) = IT min(s;, t;). 
i-i 


Goodman (1976) has proven the following theorem. 


Theorem 6.9.6 


Let W be the Brownian sheet and I, the unit cube in 2". Then, if P(x) represents 
the standard normal distribution function, 


(6.9.20) 4 [Froo dx < phsup W(t) = u} < 4d(— u) 


Ty 


for all u > 0. 


(In a paper on the same topic, Zincenko, 1977, presented an incorrect formula 
for the exact distribution of the maximum of W(t).) 

The second field for which some non-asymptotic results are available is a 
multiparameter version of the ‘triangular covariance’ process studied originally 
in Slepian (1961). To define this, let t € Æ“ and let X(t) be a zero-mean, homo- 
geneous Gaussian field with covariance function 


lul 


` Itl 
(69.21) R(t) = [T max(1 — td ) 
i-i 


Then we call X a t-triangular covariance field. For these fields Cabaña and 
Wschebor (1981) have proven the following theorem. 


Theorem 6.9.7 


Let X be a t-triangular covariance field on A~. Then, if te BX and I(t) = 


LIS. (0, ti], 


. . N N t; 1:2 
(6.9.22) Pésup X(t) = up <4 ul] . 
Ht i 1 f, t fj 
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Both of these results have applicability beyond their relevance to the par- 
ticular fields referred to in the statements of the theorems. This is because of the 
so-called 'Slepian inequality (Slepian, 1962), which can be formulated as 
follows. 


Lemma 6.9.2 
Let X(t) and X(t) be two zero-mean Gaussian fields such that, for all s and t, 
E(X4()X,(0) > E(X;()X(0j 
and 
E{X7()} = E(X2(0). 


Then, for any u, — oo <u « oo, and compact S, 
(6.9.23) P| sup X,(t) = u} < Plsup X,(t) = 2 
s s 


Thus, if we know the distribution of the supremum of a particular field, we can 
use (6.9.23) to obtain similar information for fields whose covariance functions 
are bounded by that of the particular one. An example of this procedure is given 
in Cabana and Wschebor (1981) for fields related to the triangular covariance 
fields. 

Finally, we conclude by noting three problems related to maxima that we 
have not discussed. Pickands (1969a, 1969b) introduced the notion of & up- 
crossings for processes on 4?! as a way of generalizing the usual notion of up- 
crossings to processes with non-differentiable sample functions, and showed that 
this concept was related to the distribution of M(S) in this case. A similar concept 
has also been introduced for random fields, and is discussed in Belyaev and 
Piterbarg (1972a, 1972b) and Piterbarg (1972). 

Secondly, results like Theorems 6.9.4 and 6.9.5 are clearly related to how fast, 
or slowly, X(t) grows as ||| > œ. Such growth conditions are generally ex- 
pressed by seeking upper and lower functions, dy and ġ, say, for which there 
exist finite constants Cy and C, satisfying 
IXOl L im Xl 


m 

nee ou) ^ "tyro PLO 
with probability one, where both y(t) and ¢,(¢) tend to infinity as ||t|| > oo. 
Problems of this type for fields are covered in Qualls and Watanabe (1973), 
Kóno (1975), and Judickaja (1975). Marcus and Shepp (1972) provide a 
substantial review of the one-dimensional situation, with more recent results 
given by Mittal and Ylvisaker (1976). Geman (1980) also contains some interest- 
ing results of a slightly dilferent nature. 


(6.9.24) 2C 
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It is also possible to obtain results about the distribution of the amount of 
space a Gaussian field occupies above a high level; i.e. an asymptotic form for 


P| Í Iu, oo X(t) dt > 7 


where Tiu, œ is the indicator function of [u, oo). Some partial results of this type 
are presented in Adler (1978a). 


CHAPTER 7 


Some Non-Gaussian Fields 


So far, the methodology developed in the preceding three chapters to study 
the excursions of random fields has been applied only in the Gaussian case. 
The reason for this is rather simple: the relatively uncomplicated form of the 
multivariate Gaussian density (and hence finite-dimensional distributions of 
Gaussian fields) makes it a reasonably straightforward task to carry out 
detailed computations of quantities such as the mean value of excursion 
characteristics in this case. Unfortunately, however, nature does not always 
generate only Gaussian processes, so that in order to model nature it is often 
useful, and occasionally essential, to have other than Gaussian fields in the 
modeller's box of tricks. 

In this chapter we shall look at two non-Gaussian fields and see what can be 
said about their excursion-related behaviour using the methodology that has 
already proved so useful in the Gaussian case. We shall start with an investiga- 
tion of what we shall call y? random fields. Since the y? distribution depends on a 
parameter, n say, denoting its degrees of freedom, this presents us with a wide 
class of positive-valued fields which hopefully could be used to model a wide 
variety of phenomena. Following this investigation we shall consider the 
‘envelope’ of a Gaussian field, a concept we shall develop after a slight diversion 
into some necessary spectral theory. 

Both the y? and envelope fields are, in a certain sense, generated by Gaussian 
fields without they themselves being Gaussian. The Gaussian generation, 
however, is important in terms of providing the necessary machinery (i.e. fi-di 
distributions) for handling involved computations. To keep the algebra to 
manageable proportions most of the fields considered in this chapter will be two 
dimensional only. 


71 THE x? FIELD 


To build a x? field we start with n independent, homogeneous, real-valued, 
Gaussian fields X'(t),.... 7 X"(t), te AY. We assume that each X! has zero 


los 
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mean and that they each have the same covariance function, R(t), with variance 
a? = R(0). From these we define a process Y (t) by setting 

(7.1.1) Y(t) = [X (OP t --- + [X 0r 

for each t e Z”. It is straightforward to check that the univariate density for 
Y(t) is that of a scaled y? random variable with n degrees of freedom, so that it 
is of the form 


(7.1.2) JO = [err day ye exp( 15) y>0, 


where I (x) is defined recursively by 


(7.1.3) T(x)-(x-DI(x-1, TOW=L TO=. 


We shall call any random field that is either formed in this fashion or can be 
decomposed into a sum of the form (7.1.1) a x? field with parameter n. 

The y? processes on the line have been used for some time as models for 
wind loads on engineering structures (see, for example, Hasofer, 1972) and have 
been studied in some detail by Hasofer (1974), Sharpe (1978), and Lindgren 
(19782, 1978b). They have not previously been studied on 2", except by Sharpe, 
on some of whose unpublished notes a portion of the material of this section is 
based (especially Theorem 7.1.2). 

Let us consider the covariance structure of a y? field. It is a simple consequence 
of (7.1.2) that if Y is a y? field with parameter n then, for any t e A“, 


E(Y(0) = no?, | E(|Y(t) — no?|?) = 2no*. 
Writing R* for its covariance function we have 


(7.1.4) R*(s, € = E([Y(s) — no?][Y(t) — nc?]) 
= rls (X(s)? — d b (xp) — na? |} 
= sux exer + LAX OM ELMO?) 


— 2ng? y ELX OJ} + n?o* 


= n[c* + 2R?(s, 0] + n(n — 1)a* — 2n- no^ + n?c* 

= 2nR*(s, t) 
where R is the common covariance function of the X*. Since the X' are strictly 
homogeneous, it follows that so is Y. Furthermore, the finite-dimensional 
distributions of Y, and thus all its statistical properties, are completely deter- 
mined once in and R* (or, equivalently. R) are known. 
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In order to apply the methodology of excursion characteristics to y? fields 
the first problem that requires attention is that of the suitable regularity of their 
sample functions. Since y? fields are always non-negative, it is immediate that 
the problem of suitable regularity at negative levels is vacuous. To consider 
suitable regularity at non-negative levels we must, according to Definition 3.1.1, 
investigate the various partial derivatives of Y. Denoting partial differentiation 
as is our custom, by subscripting, we have from (7.1.1) that 


(7.1.5) Y(t -2Y XOX,  forjo L..., N, 
i=1 


(7.1.6) Y= 2 ¥ XOXO +2¥ X'(0X5(0,  forj,k = 1,...,N. 
i-1 t=1 


From these relationships it is clear that the Y; and Y, will be continuous if the 
Xj and Xi, are, which is easily determined from R, or R*, as discussed in 
Sections 3.3 and 3.4. Now consider a level u > 0. Then, using the theorems of 
Section 3.2 and the fact that the Y; and Y; are, for fixed t, simple functions of 
Gaussian variates, it is straight-forward to check that, asis the case with Gaussian 
fields, x? fields always satisfy conditions (3.1.2) to (3.1.4) of suitable regularly 
as long as the joint distributions of (X, Xi, ie j, k= 1,...,.N) are non- 
degenerate for each 1 < i < n. Thus a y? field will be suitably regular at a level 
u > 0 with respect to some subset S of @ if and only if each of its component 
fields X' is. 

The x? fields are not suitably regular at the level u = 0 for any S. This is seen 
simply from (7.1.5) and (7.1.6). If Y(t) = 0 then X‘(t) = 0 for each i, so that 
Y(t) = 0 for every j, thus violating condition (3.1.2) of suitable regularity. 
Thus, for the moment, we shall only consider excursions above strictly positive 
levels. 

With the problem of suitable regularity solved, we can now state the following 
theorem. 


Theorem 7.1.1 


Let Y be a y? random field on 2t" with parameter n. Let u > 0 and S c 2" be 
compact and convex. We assume that the component fields X',..., X" of Y have 
almost surely continuous partial derivatives of up to second order with finite 
variances in an open neighbourhood of S and that the joint distribution of each X! 
and these partial derivatives is non-degenerate. Furthermore, let v(h), v;(h) denote 
the moduli of continuit y of Y;, Y;; and suppose that, for any € > 0, 


(7.1.7) P [mass v(h)] > j| = ah’) as h | 0. 
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Then the mean value of the DT characteristic of the excursion set A, = A,(Y, S) 
is given by 


(7.1.8) EQ(AQ) = ASX- DYT? Í NEC d)ó(u, 0, . .. , yw, Z) dyw dz, 


where D is the usual (N — 1) x (N — 1) matrix of second-order derivatives 
Y,;, $ is the obvious density, and the second integral is over all z € RYN 2, 


Proof 


This result is really little more than a restatement of Theorem 5.2.1 written 
in terms of a x? field. The discussion prior to the statement of the theorem 
suffices to establish that the conditions placed on the component fields X' 
ensure that Y satisfies the non-degeneracy requirements of Theorem 5.2.1. 
This is all that the proof requires. 


Although the statement of the above theorem closely parallels the correspond- 
ing one for the Gaussian case, it is not, as it stands, quite as easily applied. This is 
because we have not yet determined simple conditions under which the crucial 
continuity condition, (7.1.7), will be satisfied. We do this now in the following 
lemma. 


Lemma 7.1.1 


Let Y be a x? field with component Gaussian fields X',..., X". Write w’,(h) 
for the modulus of continuity of X. Then (7.1.7) of Theorem 7.1.1 can be re- 
placed by 


(7.1.9) Pls œi (h) > e} =o(h) ash | Ofor each e > 0. 


ijk 


The advantage of replacing conditions on the moduli of continuity of Y 
and its derivatives by similar conditions on those of the component fields lies 
in the fact that since the latter are Gaussian we can appeal to the results of 
Chapter 3 (e.g. Theorem 3.4.1) to obtain simple conditions on their covariance 
functions sufficient to ensure (7.1.9) is satisfied. 


Proof 


Set S be compact and convex, and set 


(7.1.10) ` M sup (max(| X6]. | X0]. XOD. 


tes 
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Then note that, by (7.1.5), 


Y(t + s) — Y(s) = D» [x(t + s)X(t + s) — X(s)X'(s)] 


= 2Y (Xt +s) — X(1Xt + s) 
-[X*(t +s)- X i(9)]X (s)}. 


From this we have, writing o and wi for the moduli of continuity of the X' and 
Xi, that 
J > 


vh) < 2M Y; Lo (h) + oh]. 
i=1 


But this is almost all we need to complete the proof, for, since the conditions of 
the lemma and the comments following Theorem 5.2.2 ensure that the œ tend 
to zero at the right probabilistic rate, so must the v, and a brief glance at the 
proof of Lemma 5.2.2 (which contains the bulk of the proof of Theorem 7.1.1) 
shows that this is all we need. A difficulty arises, however, as a consequence 
of the fact that the constant M of (7.1.10) appearing in the above inequalities 
is random. However, the proof of Lemma 5.2.2 can be altered slightly by 
conditioning on the event M < K* throughout the proof, and eventually 
letting K* — oo, to obtain a proof that works under the present conditions. We 
leave the details to the interested reader. 


We can now turn to developing the first tool needed for studying x? fields via 
their excursion sets: viz. a simple form for the mean value of their excursion 
characteristics. To do this, we must evaluate the integral of (7.1.8). This integral, 
which was difficult to evaluate even in the Gaussian case, is yet more complex in 
the current situation. The only evaluation of it that has been successfully carried 
out is in the case when the field is defined on the plane, for which we have the 
following result. 


Theorem 7.1.2 


Let Y(t), t e 2?, be a two-dimensional y? field with parameter n, satisfying the 
conditions of Theorem 7.1.1. Then the mean value of both the IG and DT character- 
istics of the excursion set A,(Y, I,) is given by 


(n-2)2 | A 2 
(7.1.11) u lA] E ae o fero(- i) 


2"?ng"D Gn) | a? 
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where o? is the variance of the component processes X, so that the variance of 
Y(t) is 2no*, and A is the usual covariance matrix of the first-order derivatives of the 
X'. Thus if A = (A), i,j = 1,2, 


(7.1.12) Ai RYO) 


U ^^ Ano? ^ 


Proof 


Both (7.1.12) and the fact that the variance of Y (t) is 2no* are straightforward 
consequences of (7.1.4) and the fact that RF(0) = R0) = 0, i = 1, 2. The fact 
that the mean IG and DT characteristics are identical follows as in the case for 
two-dimensional Gaussian fields, discussed in Section 5.4. Thus, in view of the 
preceding theorem, we need only show that (7.1.11) is equivalent to the follow- 
ing: 


-f f y2Y11605 0, y2; Y11) dy2 dy; 
y2> 0# -œ 


where ¢ is the joint density of (Y, Y;, Y2, Y, 1). We can rewrite this as 
(7.1.13) —E(Y3; (0Y,,(0] Y(O = u, YO) = 0} Cu, 0) 


where y* = max(0, y) and $ now denotes the joint density of Y(t) and Y,(t). 
We start by considering the conditional expectation in (7.1.13). Firstly, note 
that, from (7.1.5) and (7.1.6), 


(7.1.14) EY} Y |X = x, X = x, i= L...,n] 
= 4E{Z{Z,|X' = x, X1 = x} i= 1,...,n} 


where 
Z,- > x! 5, Z= Y [x i1 + GÀ]. 
i21 i-1 


From the basic properties of conditional Gaussian distributions and the 
correlation properties of Gaussian fields given by (2.4.9) and (2.4.10) we have, on 
setting V = a?E(| X1, |?) — A2,, that X} and Xi, have conditionally inde- 
pendent distributions and 

[X5] X1 = xi] ~ N( 2xh/Ai) AMA), 


[Ni v XL = xi] ~ NC- Au xf, Vio?) 
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Let u; and oj denote the mean and variance, respectively, of Z;, conditional on 
Xi = xi, X = xi, i = 1,...,n. Then by the above, (7.1.5), and (7.1.6) it follows 
that conditionally Z, and Z 2 have independent Gaussian distributions and 


My = Aid Y xxi, ei = |A|Ary Y (x')?, 
i= i=1 
(7.1.15) . ! 
m= DEAA + oh = Vo oy. 
i=1 i= 
Hence 
(7.1.16) 


E{Z}Z,|X' = x, X = xi i= 1,..., n} = E{ZĪ | XÍ, XB E(Z, X! xi) 
and by Lemma 5.3.3 and the above 


(7.1.17) 
2 
E(Zi|Xi = x, Xi = xij = aft — o|- 4) + Qx) a, exp( ~ a). 
0; 201 
We now change our conditioning to X = x', i = 1,...,n, and Y, = 0. To do 
this we transform X1,..., X1 to a new set of independent Gaussian variables 
Vi,..., V" via any orthonormal transformation consistent with 


yr i xix 
D. Q1 
Then, as are the X‘, the V! are independent N(0, 4, ,) variates satisfying 
Loy = Loy. 
i-1 i-1 


Under this transformation, conditioning on Y, = 0 and X' = x! implies that 
V! = 0 and y, = 0. Hence, from (7.1.17) we have 


E{Z+Z,|X' = x, Y, = 0) 


= Eve doin) mae? 5 (x!)? + y e» | 
i=1 i=2 


V! = 


= e ES (x)? + (n — Din 


Finally, we change the conditioning to Y = u, Y, = 0, to obtain from the above, 
(7.1.14), and (7.1.15) that 


(7.1.18) 


4utl2)A| 1/2 
E(YzYu|Y =u Y =O} = " | | 


que (2x) P-A ua’ 7? + (n — DA,]. 
11 


7.1 THE Y? FIELD 175 


To complete the proof we must now obtain an expression for the density 
$(u, 0) of (7.1.13). We start with the joint distribution of the X' and X}. To 
simplify the notation, set W! = X}, i = 1,...,n. Then 


(7.1.19) 
$(xl, ..., x", wl... W) = QnoA12)* enl - z|o x 2 (xi? + A] D e» 


Transform w!,...,w" to v',...,v" via any orthonormal transformation con- 
sistent with 


: yx ; xiwi 
~ DP, & Py? 1 œ] 


treating x!,...,x" as fixed constants. The Jacobian of this transformation is 
one, and )7_, (w9? = Y7., (o^, so that the joint density of X!,..., X", 
V,..., V" is identical to (7.1.19) with w! replaced by v. Now transform the 
wi to u' by setting ui = w'/[Y7. x°], i = 1, ..., n, to obtain 


$ó(xl,..., x" ul... U”) 


We now transform the X‘ by sending (X!,..., X") to 


(v - Y oy, XA... x") 


i=1 


v 


The new density is then 


$(y, x5. sa usus usum) 


n -1/2 - " 
= È EUM e»? (2noA12y1/2) 7" eap- ; Ir» s 2r- (u 4 
i=2 =a 


Aiiy 
where 


y»0 — oo <u « o6, i-2l...n, 
n-1 ; 1/2 
xe] y" o el< Dre T esses E =) e| 
i-2 
That is, the x‘ range over the sphere in 4" ^! of radius y!?. Thus 


(Y > -uY-0)- 1E fur. x? LO uuu. u") 


à 
2d ood dicc du". 
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where the factor of 4 results from noting that Y, — 2U!. Hence 


Te fae fof 


exp{— IE a ')? /uds 11} du 2. 


du" 
«eG FT 
E e "27 (2nud 1)" 2 
2(u!? - 2n Aly 
x IE fte — (x?)? — «+. — P] 1? dx? +. dx". 


By a transformation to polar coordinates the integral is easily evaluated as 
n? yu” 272/T(n/2). Combining this with (7.1.13) and (7.1.18) yields (7.1.11) 
and thus the theorem. 


Let us now see what further results can be obtained using Theorem 7.1.2. 
To start, we shall consider the behaviour of a y? field Y near the zero level. To 
simplify the discussion, we introduce the constant 


K, = |A|?" no" TGN], 


so that (7.1.11) becomes 


(120) — E(Q(AQ) = K,[u — (n — Do? Ju"? ap(- x) 


assuming that Y satisfies the conditions of the theorem. Thus 


— oo n=l, 
(7.1.21) lim E((4)) =4—K,07  n=2, 
wat 0 n>2. 


A little thought shows that this result tells us a great deal about the number of 
zeros of y? fields. For small, but non-zero, u it is clear that the excursion set A, 
consists essentially of the whole of J, including I, except for a few ‘holes’ where 
the field drops below the level u, of which there are approximately —(A,) 
(or —I(A,)) in number. As u tends to zero these ‘holes’ tend to the zero set of 
Y, i.e., {te Ij: Y(t) = 0). What (7.1.21) tells us is simply that, if n > 2, Y has, 
with probability one, no zeros. If n = 1 the expected number of zeros is infinite, 
while if n = 2 there are an almost surely finite number of points in the zero set of 
Y, and this number has the expectation Ka? = |A|' ?/(2z6?). 

It is interesting to note that this last result, for the case n = 2, seems to be 
unobtainable by other than this somewhat indirect method. For example, 
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Theorem 5.1.1, which provides the basic building block of our usual method of 
determining the mean number of points in a random set, is not applicable in this 
case since (5.1.5) is not satisfied. This is due to the fact that Y has a critical point 
at each of its zeros (cf. 7.1.5). 

Let us now turn from excursions at low levels to excursions at high levels. 
Since y? fields are built in a rather straightforward fashion from Gaussian 
fields one would hope that many of the high-level properties of Gaussian fields 
would be reflected in the properties of y? fields. However, it turns out that this 
does not seem to be the case. The main reason why Gaussian fields have a simple 
high-level structure is that the matrix of second-order derivatives (X;,,(t), 
i,j = 1,..., N, tends to be negative definite when X(t) is large. This follows, as 
discussed in Section 6.3, from the fact that, conditional on X(t) = u, the X;(t) 
have normal distributions with means proportionate to —u and variances 
independent of u, while the X;;(t), i  j, have zero means and fixed variances. 
This phenomenon is not mirrored by y? fields. For example, from (7.1.15) and the 
discussion leading up to (7.1.18), we see that the second-order partial derivative 
Y,, ofa y? field has, conditional on Y(t) = u,a mean at least proportional to —u 
but a variance proportional to +u. Thus, although in such a situation Y;, has a 
large negative mean, there is also a quite substantial probability associated with 
it taking positive values. This fact seems to preclude the possibility of repeating 
the analyses of the previous chapter for 7? fields and obtaining similar results. 

The simplest way to obtain information about a 7? field at high levels is to 
compare the mean DT characteristic of its excursion sets to that for a Gaussian 
field with the same mean and variance. To do this we must, as usual, restrict 
ourselves to the case N — 2. 

Let Y be a x? process on 2? with component processes X',..., X". Suppose 
Y, Y,, and Y, have unit variance and Y, and Y, are uncorrelated. Then (7.1.4) 
implies that each X? has variance o? = (2n) !/? and so Y has a mean of (1n)!?. 
Furthermore, by differentiation of (7.1.4) it follows from (2.4.7) that Xj and Xj, 
are uncorrelated if j # k, while each X has zero mean and variance (8n) !?. 
Thus, by Theorem 7.1.2 we have 


(7.1.22) 


(1- 2/2(gg)- 1/2 5 
EG. 1D) = sr censi Un = 0 Diew(- n 


yt 2/2 pln 25/4 u./2n 
= eroan Gn) Le 2n — (n — L)]Jexp{ — x} 


Now let X be a two-dimensional Gaussian field, with unit variance and the 
same mean as Y, i.e. (1n)! ?. Suppose, furthermore, that X, and X, have unit 
variance and are uncorrelated. Thus all the parameters of X that relate to the 
mean value of its excursion characteristies are identical to the corresponding 
ones of Y. But, by Theorem 5.3.1, 
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If we now consider these two expectations it is clear that, regardless of the 

value of n. 

(7.1.23) lim E(x(AY, I,))} > > lim E{y(A,(X, I,))}- 

This is at first rather surprising. Since X and Y were chosen to have identical 
variances, means, etc., one would initially expect similar behaviour of their 
excursion characteristics. It is possible, however, to give a simple, albeit heuristic, 
explanation of the observed difference. 

Suppose Y(t) = u. Then since Y(t) = 37., [X0]? and the X' are inde- 
pendent, the conditional distribution of the X'(t), given Y(t) = u, is uniform 
on the sphere (x e 2": ||x]| = Ju Thus, if u is large, it is just as likely that this is 
because one of the X' is close to ./u and the others are small as it is that all the 
X! take values of the order of Vuln. Consider the former case, and suppose that 
X? (t) is large. Then, being Gaussian, X! will exhibit the type of behaviour 
discussed in the preceding chapter, so that, with high probability, it will have 
only one local maximum in the neighbourhood of t. However, X?, ..., X" will 
not exhibit such behaviour and, in fact, will probably have a large number of 
local maxima and minima near t. Since Y is composed of a sum of all the X', 
its structure near t would most likely be a large dominating peak corresponding 
to X !, superimposed on which would be a number of small ‘bumps’ and ‘hollows’ 
corresponding to the maxima and minima of X?, .. ., X". A little thought shows 
that this would, in general, generate a larger DT (or IG) characteristic for 
A,(Y, I,) than for A,(X', I) and that this would lead, ultimately, to (7.1.23). 

To make this argument fully rigorous would require the development of a 
model process for x? fields of the type developed in Section 6.7 for Gaussian 
fields. The anticipated algebra of such a development is too forbidding for us to 
attempt this analysis here. Thus we shall conclude our study of y? fields at this 
point and turn toa closely related process, the envelope of a Gaussian field. 


72 SOME FURTHER SPECTRAL THEORY 


In order to properly define the envelope of a random field we need to firstly 
develop the spectral theory of homogeneous fields beyond the limits of Chapter 2. 
We start with a homogeneous, mean square continuous, complex-valued random 
field X(t), te ^, with zero mean and finite variance. By Theorem 2.12 the 
covariance function has a representation of the form 


(7.2.1) R(t) = [exp - 3) dF), 


while, by Theorem 2.4.1, X itself has the following representation as a mean 
square integral: 


(7.2.2) X(t) — f exp(it : A) dZ(À). 
*N 
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The random field Z has orthogonal increments and finite variance, and, for 
any interval I c 4", E(|Z(D)?) = F(). 

In Section 2.4 we saw that if X was real valued the spectral distribution 
function possessed certain symmetry properties, and (7.2.1) simplified to 


(7.2.3) R(t) = f cos(t - 4) dF(A). 
RN 


Let us now restrict ourselves to the case N = 2, since this simplifies the 
notation somewhat while not adding any restrictions to the analysis of the 
following section. Furthermore, let us assume that the spectral distribution 
function possesses a density, f (à). Then, since f(4) = f(—A) for all X e & it 
follows that we should be able to simplify (7.2.3) even further. To do so, let Zt), 
denote the half-plane {ù € #?: A, > 0} and define a function G(A) on 42,, by 
setting G(A,, 0) = 0 for all A, € (— oo, oo) and 


2f faro - 2 [ (500. A, <0, 
(724) 600-1 77... M" 
| 2f dra) - 2 f foa À, 2 0. 


o 


Then it is an immediate consequence of (7.2.3) that 


(7.2.5) R(t) = Í cos À -t dG(A). 
R 


1/2 


This half-spectral representation of R could have been taken over any of the 
four half-planes of #?, each one of which would yield, because of the symmetry 
of f, an equivalent representation. The function G is called the real form of the 
spectral distribution function and, of course, it possesses a density, g say. 
The importance of the representation through G rather than F is that it enables 
us to develop a representation for the process X itself which, unlike (7.2.2), relies 
only on real-valued fields. 

To develop this representation we define two new random fields, U(X) and 
VAA), à € Ri 2, by putting U(4,, 0) = V(A,, 0) = 0 for any 4, and, for A, > 0, 
setting 


UQ) = Z((0, 4]) + Z(—A, 0], 
V) = i[Z(0, X] — Z-A, 91]. 


where we write (v. A] for (v, 4,] x (v; A; J and we always evaluate Z((v, X ]) as 
Ziv) Zi, AL) ZO À 4 ZA). without any regard to possible orderings 
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among v and X. Now, for any v, à e R? there is an inverse relationship to (7.2.2) 
given by 
(7.2.6) 
(gsm vn e inner n e iam) 
Z((v, X] = i.m. f 3E X(t) dt. 
Qn Gs T 0 Eyl, 


This is the usual inversion formula for a non-random Fourier pair satisfying 
(7.2.2) (cf. 1.4.5) and it is straightforward to check that it also holds in this case. 
Then from the definitions of U and V and (7.2.6) it easily follows that 


(7.2.7) UQ) = 4x7? Lim. ial j 


T> œo 


x (cos à- t — cos A,t, — cos Azta + 1)(tit2) 1 X(0 dt, 
(7.2.8) 


T pT 
VQ) = in ?lim. f f (sin X - t + sin A,t, + sin A, t2)(t,t,) 1 X(t) dt. 
T^v J4—-TÀ/-T 

The existence of these mean square integrals is easily established as a conse- 
quence of the mean square continuity of X. 

Furthermore, from their definitions and the above relationships it follows 
that both U and V are real-valued fields with orthogonal increments. Also, if 
) € R? 2, it follows from the properties (2.4.3) of Z that 


(7.2.9) E{U(Q)} = E{V(A)} = 0, 

(7.2.10) E(U*Q)] = E(V?Q)) = GQ), 

and 

(7.7.11) E{U*(D} = E{V?(D} = GD, 

for all rectangles I c 22, , while for all disjoint I, and J, in 23,3, 
(7.2.12) E(UGQUG5)) = E(VU )V5)) = 0. 


Taken together, U and V arecalled the real form of the spectral process associated 
with X, and we have the following result, which contains the representation we 
have been seeking. 


Theorem 7.2.1 


Let X be a real-valued, mean square continuous, homogeneous random field on 
R*. Then, if U and V are defined by (7.2.7) and. (7.2.8), X has the following 
representation as a mean square integral: 


(7.2.13) xe - | cos -t dUQ) + f sin A-tdV(A). 
Rin i 


vain 


U and V are real-valued; orthogonal increment fields satisfying (7.2.9) to (7.2.12). 
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This is the main tool we need to define the envelope of a random field. The 
assumption that F (and thus G) possesses a density is essentially irrelevant to 
this result and only has the effect of simplifying the algebra of both its proof and 
statement. However, since the simplification is substantial we shall continue 
with the assumption. For a development of a half-spectral theory in one di- 
mension without this assumption see Cramér and Leadbetter (1967). 


7.3 THE ENVELOPE OF A HOMOGENEOUS FIELD 


If A(t) denotes a non-random function on Z" oscillating about the level zero 
it is often of value to find a non-negative function B(t) of a simpler form such that 
| A(t)| x B(t) for all t, and | A(t)| = B(t) for some t. A function B(t) with these 
properties is called an envelope of the function A(t) and has a number of uses in 
practical situations. In particular, if B is chosen in such a way that | A(t)| = B®) 
usually implies that t is a local maximum of A, then B can often be considered to 
be an approximating function to A, at least at high levels of A. Furthermore, B 
is generally chosen to be somewhat smoother than A, so that while it follows, at 
high levels, the general shape of A it does not have the same quantity of small- 
scale perturbation. 

When we turn to homogeneous random fields X(t) on 4" an envelope can be 
reasonably simply derived from the field’s spectral decomposition, given in 
Theorem 2.4.1. Essentially what we seek as an envelope of X(t) is a non-negative 
field, Y(t) say, for which | X(t)| x Y(t) for all t, with equality for some t. Such a 
field can be obtained if its spectral decomposition is similar to that of X at low 
frequencies (i.e. 4 for which |A|| is small) and it has no, or very weak, components 
at high frequencies. To develop this idea properly we shall restrict our attention 
to two dimensions, and follow the construction of Adler (1978b). 

Let X(t), t€ Z?, be a real-valued, mean square continuous, homogeneous 
random field. Then it has the full-spectral representation (2.4.2) and the half- 
spectral representation (7.2.13). From these we define a new, complex-valued 
field, called the complex field corresponding to X, by the half-spectral repre- 
sentation 


(7.3.1) X*(t) — af exp(i4 - t) dZ(A). 
Ri 
We call the real-valued field, X(t), defined as follows, the Hilbert transform of 
X(t): 
(7.3.2) X*(t) = X(t) + iX(t). 
It follows from these definitions and (7.2.13) that € has the representation 


(7.313) ` L(t) [ 


E i 


sin A: tdU(A) — [ , cos kt dV (A) 
$ as EP 
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with U and V as in (7.2.13). It is immediate from this representation that £ and 
X have the same covariance function, R(t), and the same ‘real form’ spectral 
distribution function, G(A), with density g(X). Furthermore, the cross-covariance 


R*(t) = E{X(s)X(s + 0) 
is, in view of (7.2.9) to (7.2.12), given by 


(7.3.4) R*(t) = Í sin à - t dG(À), 
Rin 

implying 

(7.3.5) R*(—t) = —R*(t). 


We now define the envelope Y(t) of X(t) to be the modulus of the complex 
field X*(t). That is, 


(7.3.6) Y(t) = |X*(0| = [X?(t) + LO. 


Clearly Y(t) possesses the property that | X(t)| x Y(t), with equality for those 
t for which X(t) = 0. 

As in one dimension (cf. Cramér and Leadbetter, 1967, sec. 14.2), the concept 
of an envelope has little practical significance unless the real form of the spectral 
density, g(À), has most of its mass concentrated in a relatively small neighbour- 
hood of some particular value of à € fia, @ say. Hereafter this will be assumed 
to be the case. Then, since by (7.3.1) we can write 


X*(t) = exp(io -t)-2 js exp[i(A — @)-t] dZ(4), 


it is clear that we can express X *(t) as the product of a carrier surface, exp(ia - t), 
and a slowly varying amplitude function whose modulus is the envelope Y(t). 
This is the primary practical motivation of the definition (7.3.6) of the envelope 
of a random field. 

We can now investigate the excursion sets of Y, as in the following theorem. 


Theorem 7.3.1 


Let X(t) be a real-valued, zero-mean, homogeneous Gaussian field on R? 
satisfying the conditions of Theorem 5.2.2 for N — 2. Furthermore, let it have a 
spectral density and let the real form of this density, gA), be symmetric about some 
point € of R?,2. If Y(t) is the envelope field defined by (7.3.6) then 


(7.3.7) 
E(X(AY, 1) = EXT(AAY, 1j 


1,2 

3 - , lu 
= Qn) '(m$o T mi) ^ msg u*(1 — fion oe (- mo ) 
00 
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where 


(7.38) my [Gs ~ o); — og) d 


1/2 


Proof 


We start by introducing two new fields, W, (t) and W,(t), which are the real and 
imaginary parts of the field exp( — io - t)X *(t); i.e. 


W(t) + iW, = exp( — io - £)X*(0). 
Then, according to (7.3.6), 
(7.39) Y(t) = X*W] = [W10 + WO]. 


We claim W, and W, are independent Gaussian processes. The fact that they are 
Gaussian follows from the fact that Z(t) is a complex Gaussian field. (This is an 
immediate consequence of the inverse spectral formula, 7.2.6.) To check that 
W, and W, are independent, we need only show that W,(s) and W,(s + t) are 
uncorrelated for each s and t. But, applying (7.3.5) and (7.3.4), 


E{W,(s)W,(s + t)} = E([X(S)cos o -s + X(s)sin o. s] 
x [X(s + t)cos o - (s + t) — X(s + tsin o - (s + t)]} 
= — R(t)sin o. t + R*(t)cos@ -t 


= Í sin(à — @)-t g(4) da 
D 
which is zero from the assumed symmetry of g. 

Thus by (7.3.8) we have that the field Y?(t) is simply the sum of two, inde- 
pendent, Gaussian fields, and so is a y? field with parameter two. The excursion 
set of Y above the level u is simply the excursion set of Y? above u?, and we can 
use Theorem 7.1.2 to evaluate the mean value of the excursion characteristics 
in this case. The conditions of the current theorem certainly imply that those of 
Theorem 7.1.2 are satisfied, so that all we need to do to use that result is to 
evaluate the appropriate spectral moments. But these are simply the mij; of 
(7.3.7), since both W, and W, are easily seen to have the spectral density g(A). 
Substitution into (7.1.11) then produces (7.3.6); and we are done. 


In conclusion we note that all that was derived in Section 7.1 about y? fields 
from a knowledge of their mean excursion characteristics applies equally well 
to the envelope of a Gaussian field simply by noting that the square of the 
envelope is a 7^ field with parameter two. 


CHAPTER 8 


Sample Function Erraticism and Hausdorff 
Dimension 


In the preceding chapters we have always assumed that the random fields 
we were studying satisfied a large number of regularity conditions. In particular, 
we have almost always assumed that the sample functions of these fields were 
continuous with probability one, as were their first- and second-order partial 
derivatives. Such assumptions are not always justifiable, however, and in this 
chapter we shall investigate what happens when some of these regularity 
conditions are no longer satisfied. To try to give the flavour of what is about to 
come we shall firstly consider a particular example. 

Without doubt the most widely studied, and perhaps most important, of all 
Gaussian processes on 4?! is the Wiener process, or Brownian motion. This is 
simply a Gaussian process, W(t), t > 0, with zero mean and covariance given by 


(8.0.1) E(W(sSW(t)) -s^t-—i(t-cs-]lt— sl) 


where s ^ t — min(s, t). It is straightforward to check from the theorems of 
Chapter 3 that the Brownian motion possesses almost surely continuous sample 
paths. However, as will become clear later on, these paths are non-differentiable 
at almost every t. Thus any attempt to describe the level crossing behaviour 
of a Brownian motion via a methodology reliant on the existence of sample 
path derivatives of any order is immediately doomed to failure. To see what this 
means for random fields, let us consider a two-parameter generalization of W(t) 
known as the planar Wiener process, or Brownian sheet. This is a zero-mean 
Gaussian field W(t) on 47. = (te 2?:t,, t, > 0} with the covariance structure 


E{W(s)W(} = (s, ^ ti) 2 ^ t2). 


We shall investigate the properties of this field in some detail later on (particularly 
in Section 8.9), but for the moment consider Figure 8.0.1, which depicts a 
simulated example of W(t), t € (0, 1]?. In this example only those sections of W 
for which W(t) > 0 are shown, and the negative sections of W have been re- 
placed by parts of a plane. From this example the origin of the nomenclature 
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‘Brownian sheet’ should be clear. Since (8.0.1) implies that W(t) = 0 if t lies on 
one of the axes in #*, the sample functions resemble a roughly shaken bedsheet 
tied down on two adjacent sides. (This analogy was originally drawn by Pyke, 
1972.) 

Suppose now that we wish to apply the analysis of Chapters 6 and 7 to this 
process; i.e. we wish to study its excursion sets. If we write, for 0 < x < 1, 


L, = (te(0, x] x (0, x]: W(t) 20), 


then Figure 8.0.2(a) depicts the level set L,. This figure would seem to indicate 
that the excursion characteristics of Ao(W, 1,) are well defined, so that we can 
proceed with the desired analysis. 


Figure 8.0.2 The zero level curves of the Brownian sheet on (a) the unit square, 
(b) (0. 4) x (0, 4). (c) (0, 1) x (0, 1). (Reproduced from Adler, 1978d. by permission of 
the Applied Probability Trust) 
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However, Figure 8.0.2(b) seems to indicate that neither A,(W, 1.) nor its 
excursion characteristics are as well defined as we might hope, for here is 
depicted again the lower left-hand corner of Figure 8.0.2(a), i.e. L,;2, magnified 
by a factor of two. However, whereas L, was obtained by observing the sheet 
on a square grid of points separated by a distance of 1/64, the distance between 
points used to obtain (b) was only 1/128. A brief glance shows that this magnifi- 
cation of L,,; indicates a much higher degree of complexity than that visible in 
the original diagram. In Figure 8.0.2(c), L4 ;4 is shown, again magnified, and this 
time the gap in the underlying grid is only 1/256. The additional complexity of 
L,;4is even more noticeable here. One could continue in this fashion indefinitely, 
and on each occasion it would be necessary to revise our estimates of the 
excursion characteristics of Ao(W, I,). This is quite a different type of phenome- 
non to that discussed in Section 5.5, where, under regularity conditions not 
satisfied by the Brownian sheet, we saw that after a while taking finer and finer 
grids, as above, yielded no change in the excursion characteristics of Ao(W, I,). 

The import of all this is that the methodology of excursion characteristics 
is not applicable when the usually assumed regularity conditions are not 
satisfied. Thus, if we still wish to study the sample function properties of random 
fields either through their excursion sets or the manifolds which bound them, 
we must turn to a quite different methodology. The appropriate tool turns out 
to be a very old one, known as the Hausdorff dimension, which dates back to 
Hausdorff (1919). This is discussed, in some detail, in the following section. 
Following this discussion we introduce a particular class of Gaussian random 
fields, known as index-f fields, which do not satisfy the regularity conditions we 
have generally required up to now. After an investigation of some of the simpler 
aspects of these fields, we proceed to use the concept of the Hausdorff dimension 
to investigate some of their more esoteric sample function properties. This is 
continued in more detail and generality in the final sections ofthe chapter. 

We have attempted to make a fairly distinct division between the easier and 
more difficult aspects of the forthcoming analysis. In the first five sections we 
shall consider the more straightforward problems, and the treatment here is 
essentially self-contained. The remaining sections cover reasonably difficult 
ground and assume a higher level of mathematical sophistication of the reader. 
Throughout the chapter a familiarity with the basics of measure theory is 
assumed. 

Finally, a comment on the title of this chapter is probably called for. When 
Gaussian fields do not possess smooth sample functions these functions are 
generally non-smooth in the extreme, i.e. they are inordinately erratic. The 
methodology of the Hausdorff dimension is useful in quantifying exactly how 
erratic a given function is. While we were searching for a term to describe the 
property of being erratic, Professor Joe Horowitz suggested (primarily in jest) 
the term erraticism. Since this term has met with general approval whenever 
it has been used, this seems to us to be an appropriate place to launch it into the 
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probabilistic literature and the English language as a description of the property 
of being erratic. 


81 HAUSDORFF DIMENSION 


The concept of dimension is implicit in the most fundamental mathematics, 
and certainly has an important role to play in the study of random fields insofar 
as we use it to characterize the parameter space of our processes, Experience tells 
us that a finite set of points has dimension zero, a straight line has dimension 
one, a filled in square dimension two, and so forth. There are situations, however, 
in which the limitation of using only integers to describe the dimension of some 
object is unnecessarily restrictive. To see why this might be so let us look at a 
simple set in 2’. 

The set we shall consider is known as the Cantor ternary set and is readily 
constructed in a sequence of steps, as follows. We start with the interval (0, 1) 
and as a first step delete the middle third [4, 3]. Each of the remaining two 
pieces (i.e. (0, 1) and (3, 1)) then has its middle third removed, leaving four 
pieces, from which the middle thirds are deleted, and so on, ad infinitum. The 
resulting set has some rather strange properties. It can easily be seen (and is a 
consequence of what is shown below) that it has zero Lebesgue measure and so 
in the sense of "length' it is a trivial subset of (0, 1). On the other hand, it is just 
as easily seen to consist of precisely those numbers in (0, 1) which, when ex- 
pressed in a decimal expansion to the base 3, contain only 0’s and 2’s in their 
expansion. Considered in this light we see that any number x € (0, 1) can be 
identified with a unique number y in the Cantor ternary set simply by taking its 
expansion x = xX, X3 --- tothe base 2, and making it correspond to the number 
y with expansion y = y,)2y3--- to the base 3 by setting 


xey iff x; -0-—y 20 and x;cley-2i-L2... 


But this correspondence indicates that there are effectively as many points in 
the Cantor set as there are in (0, 1), despite the fact that the Cantor set has zero 
Lebesgue measure! 

The solution to this apparent paradox lies in realizing that (N-dimensional) 
Lebesgue measure is not always the appropriate tool for measuring the size 
of sets, especially when, like the Cantor set, they are somewhat erratic. A larger 
class of measures, which enables us to come to grips with this and similar 
paradoxes, are the so-called «-dimensional measures, which we shall now con- 
struct. 

Let A be a subset of A~. The a-dimensional outer measure S,(A) is defined for 
positive « as follows. Let 


(8.1.1) S, (A) = inf Y (diameter B,)’, 
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where, for each ¢ > 0, the infimum is taken over all collections of open spheres 
{B;} in Z" whose union covers A and for which the diameter of each B; is not 
greater than £. Such a collection will be called an & covering. The sum is taken 
over each particular collection {B;}. 

As e decreases the infimum in (8.1.1) extends over smaller and smaller classes 
of coverings and hence S, (A) increases, or at least does not decrease. Therefore 
the limit (finite or infinite) 


(8.1.2) SA) = lim S, (A) 
él0 
exists. 


Clearly $,(-) is monotone in the sense that S,(A) < S,(B) if A c B. Further- 
more, S, is subadditive; i.e for any sequence of sets A,, A3, . .. in BY, 


(J An) < 2; S A,). 
To see this, fix £, ô > O and choose e coverings {A,,;} such that 
Y, (diam A, € S,,{A,) + 6-27" < S(A,) + 6-27". 
All the spheres together form an e covering of Us A, with È; , (diam A, < 
Y, S(A,) + 6. Letting ô | 0 establishes the subadditivity of S,- 
The Hausdorff dimension of a set A can now be defined by considering the 


behaviour of S,(A) not as a function of A but as a function of x. We start with the 
following lemma. 


Lemma 8.1.1 


For any set A c 9" there exists a unique number a*, called the Hausdorff 
dimension of A, for which 


(8.1.3) a < a* => S,(A) — oo, a> a* = $(A) — 0. 
This number is denoted by dim A and satisfies 
(8.1.4) a* = dim A = sup{a:8,(A) = oo] = inf{a: S4) = 0}. 


Proof 


We start by showing that if S,(A) is finite then S,(A) = 0 for any f > a. 
Let {A;} bean £ covering of A for which 


X(diam A;)* < S,, (A) + | < S(A) 4+ |= K « c. 
Then 


Sai dam 4, «5A "Midiam A, eK, 
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Since f > a we have, on letting e | 0, that $,(A) = 0 as claimed. Thus if $,(A) 
is finite for some particular value of « then it is zero for all larger values. There- 
fore, there exists a transition point, «* say, satisfying (8.1.3), and definition 
(8.1.4) makes sense. 


Note that we have no information about S,(A) when « = dim A. It could be 
Zero, positive and finite, or infinite. To obtain more precise information of this 
nature it is generally necessary to make use of the more refined notion of 
measure functions, a somewhat more esoteric concept that we shall not introduce 
here. á 

There is nothing in the definition of the Hausdorff dimension that guarantees 
that the dimension of a set need be integral. This is precisely where the im- 
portance of this concept lies, for it allows us to make reasonably fine differentia- 
tions between sets of varying degrees of erraticism. However, the Hausdorff 
dimension of a ‘simple’, non-erratic set always turns out to be integral and yields 
the integer one would intuitively choose as the dimension of the set. As an 
example of this, we shall look at a simple example (given in Billingsley, 1965) 
which shows that the dimension of a reasonably smooth surface in 4? is two. 

Suppose A is defined by z = F(t), te I, c 4?. Without any further assump- 
tions at all we can prove dim A > 2. To see this, suppose A is covered by 
spheres A, of diameter d;. If p represents vertical projection of sets in 4? onto 
the (t4, t2) plane, then the pA; cover pA. Since pA, is a disc of diameter d; (area 
1d? /4) and pA is I, (area 1) it follows that Z;zd2/4 > 1, or Z(diam Aj > 4/n. 
Since this is true for any covering {A;} we have from (8.1.2) that S,(A) > O and 
so by (8.1.4) dim A > 2. 

To prove the opposite inequality we need to assume a specific smoothness 
condition for F, viz. @p(ô) = 0(6), where œp is the usual modulus of continuity 
of F determined by 
(8.1.5) wp(ð) = sup{| F(s) — F(t)|:s, te L,, |s —t|| < ô}. 

Such a condition would be satisfied, for example, by an F with bounded partial 
derivatives. We shall prove that dim A < 2 by showing S,,,(A) = 0 for any 
n > 0. Choose K > 1 so that, for small enough ô > 0, œp(ô) < Kô. Now split 
I, into n? small squares of side 1/n. Since, if n is large enough, the variation of F 


on one of these small squares is less than NE K/n, that part of the surface of A 
lying over a small square can be enclosed in a cube of side 2 K fn, which in turn 
can be enclosed in a sphere of diameter J6K /n. Thus A can be covered by n? 
spheres of diameter /6 K/n. For this covering X(diam A;)?*" = (/6 Ky? n", 
Now fixe > 0 and choose n such that / 6 Kn « &, Then 


So 4n,4A) < (EKP *"n7", 
so that on letting n > »: we have $5,, (4) = 0, implying 5;,,(4) = 0 and 
dim A < 2. 
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This sort of argument (which we shall use again in proving Lemma 82.1, 
albeit in a more involved framework) shows that reasonable sets get the di- 
mensions they should. In general, however, our interest will not be with these 
sorts of sets, so let us look again at a highly erratic set, as in the following 
theorem. 


Theorem 8.1.2 
The Cantor ternary set has Hausdorff dimension log 2/log 3 — 0.6309 .... 


Proof 


Half the proof is easy. Let C denote the Cantor set. For each n 7 1 it follows 
from the method of construction that C can be covered by 2" open sets of 
diameter 3^", where n > 0 is arbitrary. Thus 

SAC) x lim 27.3 
But if « = log 2/log 3 we have 2” - 3 "* = 1, so that S,(C) < 1. Equation (8.1.4) 
now implies that dim C < log 2/log 3. 

The second half of the proof, i.e. obtaining the reverse inequality, is somewhat 
more involved and, essentially, involves showing that S,(C) > lif = log 2/log 3. 
This implies dim (C) = log 2/log 3, as required. We shall not give the details of 
this, but refer the interested reader to the original paper of Hausdorff (1919, 
pp. 169-172). 


Theorem 8.1.2 indicates that the Hausdorff dimension plays, at least as far 
as the Cantor set is concerned, the discriminating role we had planned for it. 
That it also applies to the more general problem of sample function erraticism 
will become clear in what follows. At this point the reader who is interested only 
in the definition of the Hausdorff dimension and its application to sample 
function theory can turn to Section 8.3. The following section is devoted to a 
development of the tools needed for this application. For generalizations of the 
above theorem to other subsets of [0, 1] specified in terms of properties of such 
things as their continued fraction expansions see the monographs of Billingsley 
(1965) and Rogers (1970) and the references contained therein. Billingsley gives 
an interesting proof of (a more general result than) Theorem 8.1.1 based on 
probabilistic and entropy arguments. 


8.2 SOME USEFUL LEMMAS 

In what is to come, we shall be using the concept of Hausdorlf dimension to 
investigate sample function erraticism by determining the dimension of sets 
such as the graph, range, and zero set, generated by these functions. [t wall 
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invariably be the case that the determination of these dimensions will be under- 
taken in two quite distinct stages. Firstly, we shall argue, although the functions 
in question are quite erratic there is usually a natural limit to their erraticism, 
generally expressible as some sort of Hólder condition (defined below). Given 
such a limit reasonably straightforward arguments can be used to establish an 
upper bound to the dimension (and so the erraticism) of the sets in question. 
Most of the tools for this part of the argument are set up in the first two lemmas 
of this section. The second part of the argument rests on showing that the 
previously established upper bounds also serve as lower bounds, and thus are 
actually the dimension. This part relies on results from capacity theory, which we 
shall develop later in the section. 

Although our primary interest has always been with real-valued random 
fields, we have often encountered fields which take their values in a multi- 
dimensional space. Such a field, for example, was the Z4"-valued 


X —(X —u, X,,..., Xy.) 


of Chapter 5, which we needed to study to determine properties of the excursion 
sets of the N-parameter, real-valued field X(t). In Section 5 of this chapter 
we shall use the methodology we are about to develop to re-examine certain 
properties of the vector-valued field X when the smoothness conditions of 
Chapter 5 are not satisfied. So that we can do this, we shall, throughout the 
remainder of this chapter, study functions on 2", taking values in Z4, where 
N > l1 and d > 1. Fortunately, working at this level of generality has only a 
minimal effect on the difficulty of the mathematics, while very often producing 
far more interesting results than those existing when d — 1. 

Thus we shall now write F to denote an Z^-valued function defined on the 
unit cube, J,,in Æ”. At the risk of some confusion with our previous notation, 
where F; denoted the partial derivative of a real-valued F(t) with respect to t;, 
we now write F; to denote the ith component function of F; i.e. F = (F,,..., Fa) 
With this notation, we can introduce the following smoothness conditions on F. 


Theorem 8.2.1 


A function F : 4?" — R’ is said to satisfy a uniform Lipschitz, or Hólder, con- 
dition of order & = (a,,...,&4), 0 < a; < 1, on the cube I, if there is a finite 
constant A such that, for every t, t + he I,, and ||h|| small enough, 


(8.2.1) |F (t) — F(t- | x Alb”, — forjo L....d. 


We shall always use the term 'Hólder condition’ rather than *Lipschitz 
condition’, the former representing the more common usage in sample function 
theory. Our first result is as follows. 
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Lemma 8.2.1 


Let F: 4?" — 4 satisfy a Hólder condition of order a on 1,. Then writing the 
graph and image of F as 
Gr F = {(t, F(t): te I) c Anti 
Im F = (F(0:te1,) c #4, 


and assuming that 


(8.2.2) 0<a,<a% <---<uy<1 
we have 
d — Q: 
(8.2.3) dim(Im F) < mil n x =! (s a 
d 


(824) —dim(Gr F) < min |^ T D Qa 2) Na » a a|; 


Firstly, note that (8.2.2) is not a real restriction, since we can always renumber 
the F, to ensure that this condition holds. We have actually already established 
a special case of (8.2.4) in the preceding section, where we showed that if N — 2, 

= 1,« = 1, then the graph of a planar function satisfying the above conditions 
had dimension at most two. (See the example based on 8.1.5.) The following 
proof of Theorem 8.2.1 is a relatively straightforward extension of that we used 
for this special case. 

As far as we are aware, the above result is almost classical for N = d = 1, 
although the standard reference given for it in the more recent literature is 
Kahane (1968). For general N and d, and a, = --- = o4, it is due to Yoder 
(1975), and in its current generality to Cuzick (1978), whose proof we follow. 


Proof of Theorem 8.2.1 


We start with (8.2.3). Clearly dim(Im F) < d. Split I, into 2"" small cubes of 
edge 2^". By the Hólder condition the image of each of these cubes is contained 
in a d-dimensional interval, the length of whose edges is proportional to 
2°", j = 1,...,d. Each of these intervals can be divided into 2"*!6«-*2 cubes 
with edge length proportional to 2 ""d. Thus, from its very definition, the 
dimension of the image is no more than the infimum of all A for which 


QnN , Qnzf- s(aa- a) , 3-7 Ama _, () as n > oo. 
But this is true for all 


N+ ye. 1 (Xa — 0) 
Xa 


À 


so that we have established (8.2.3). 
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To prove (8.2.4) we firstly note that proceeding as above we can cover Gr F by 
2"" intervals of dimension N + d, N of whose edges have the length 2^", with 
the remaining edges proportional to 2^"; j = 1,..., d. The latter sides are, 
of course, the longer ones. Thus we can cover each of these intervals by 


2n Ef- ı laa- ai) 
cubes of edge length 2^ "^4 so that, proceeding as before, we find 


N + $f- (a — 9) 


dim(Gr F) < 
Xa 


On the other hand, these same 2"" intervals can also be covered by 


gn dfs 11-2) 


cubes of edge 27”, so that the same argument now yields 


d 
dim(Gr F) x N+ Y (1— 4). 
i=1 


Combining this with the previous inequality establishes the inequality (8.2.4), 
and we are done. 


The upper bounds of this lemma will later turn out to be very useful. In the 
following lemma we obtain similar bounds on the dimension of the level 
sets of F, i.e. the sets of the form 


F !(u)- {t e 1,: F(t) = uj, 


for some ue Z7. This result will turn out not to be quite as useful, for it yields 
results only for almost every level u, and so, as far as is known, there may well be 
some (unspecified) values of u for which the given bounds are not valid. If we 
are interested in only one specific level (u — 0, for example) we cannot apply 
the result of this lemma to that level. 

Lemma 8.2.2 was first proven for general d, N = 1 and à, =--- = o4, in 
Kahane (1968). Adler (1977b) treats general N. Cuzick (1978) states the version 
given here, but without giving a proof. By ‘almost every’ we, obviously, mean 
with respect to Lebesgue measure in Z^, i.e. the bound fails for, at most, a u set 
with A, measure zero. 


Lemma 8.2.2 


Let F: A“ > R? satisfy a Hólder condition of order a on 1,. Then if N — 
yd. , a, > 0,dim(F~ '(u)) < N — 34 a; for almost every u in A". 
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Proof 


As usual, we start by splitting I, into 2"" cubes of edge 2^". Call these cubes 
Baii = 1..., 2". Let G, denote the union of the (N + d)-dimensional intervals 
Bni x F(B,;). Then the graph of F is contained in G, for every n. Furthermore, 
using the Hólder condition on F we have that the Lebesgue measure of G, in 
AN*4 is Q(27"2-:9), Given ue Z^, let E,(u) denote the union of cubes B, 
for which u € F(B,,). This set clearly contains F~ !(u) for every n. Then 


Í Ay(E,(u)) du = Ay, (G,) = O(2- "EF 1%), 
Ra 
Thus, given s > 0, we have 


Y 2-n6-Xf-ia) f Ax(E,(u)) du < œ, 
A gå 


n= 


implying 


2,27 Te HAE (u)) < oo 
=1 


n 


for A, almost every u. Thus 
Ay(E,(u)) = O(2~ EF 12-2) 


for almost every u. But since E,(u) is made up of a number, N,(u) say, of cubes of 
side 2^", this tells us that, for almost every u, N,(u) = O(2"" 2% :*:* 9), Using 
this and the fact that N — $2. , o; > 0 it follows that the dimension of E,(u) 
and hence of F^ (u) is at most N — 2. o; + e for almost every u, which, on 
letting ¢ | 0, proves the lemma. 


These two lemmas will, for the moment, be sufficient to establish upper bounds 
on the dimensions of the random sets we shall wish to study. To derive lower 
bounds we need to turn to the notion of capacity, developed as follows. 

Let A be a compact subset of Z” and u a countably additive measure defined 
on 4P, the Borel sets of Z~, with support A and total mass 1. That is, 


U 4) = Xu) 


(8.2.5) Aj, Ass... disjoint = p( 


(8.2.6) H(A) = WA) = 1. 


For « > 0 define the energy integral 


"77 Wr NER 
Jag Ix y|! 
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Let 
WA) = inf Lu), 


where the infimum is taken over all positive measures p satisfying (8.2.5) and 
(8.2.6). Then we have the following definition. 


Definition 8.2.1 
The « capacity of A, denoted by C(A), is defined by 
— l/a ` 
cqa- [WAE EWA < 00, 
0 otherwise. 


It is immediate from this definition and the compactness of A that if C,(A) = 0 
then C,(A) = 0 whenever f > a, while C,(A) > 0 implies C,(A) > 0 for all 
B < a. This rather simple fact has the following lemma as a consequence. The 
lemma goes back tó Frostman (1935), although the proof we give is essentially 
a modification of one due to Carleson (1957), as suggested by Taylor (1961). 


Lemma 8.2.3 


Let A be a compact subset of A~. Suppose there exists a positive measure u 
satisfying (8.2.5) and (8.2.6) and a finite « such that the energy integrals Ig(u) are 
finite for all B < a. Then dim(A) > a. 


Proof 
Let 6 > Oandset 8 = a — 28. Then I,(u) < 9o0,asis I5. (4), i.e. 


f [ix — yj 9*? d(x) duty) < oo. 


From this it follows that there must be a closed subset, A, say, of A such that A, 
has positive u measure and 


Í Ix — y|- 9*9 duty) < M 
A 


for some finite M and all x e A,. Let e, be a sequence decreasing to zero, and 
for each n let S$,,,... , Spm, be open spheres of diameters at most e, which cover 
A,. Choose a point x,; from each $,; ^ A,. Then 


[diam $,,| P+” u(S,) < Í [Xni — yl. “t” duty) < M. 
Ay 
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Thus 


0< MA) < Y Sn) < M Y. diam Spil’, 


i=1 i-1 


from which we have 


l 


my mn 
Y [diam Sui? = lel Y, |diam Suil? 
=1 i=1 


Mi) 
~ Mle? 


which diverges as n > oo. Hence dim(A,) > fj = a — 20. Letting ó | 0 and 
noting that since A, < A we have dim(A) > dim(A,) completes the proof. 


There exist much stronger results than Lemma 8.2.3 relating the capacity of a 
set of its Hausdorff dimension. For example, there exists the notion of the 
capacity dimension of a set, C-dim (A), defined by 


0 if CA) = 0 for all æ > 0, 


C-dim(A) = d if C,(A) = 0, « >d, 
CfA) > 0,0<a<d. 


It can be shown that a compact A c Z" has identical capacity and Hausdorff 
dimensions. Lemma 8.2.3 actually shows that dim(A) > C-dim(A). However, 
the proofof the reverse inequality (due to Kametani, 1944, 1953; see also Carleson, 
1950, and Taylor, 1961) is substantially more complex, and as we shall not need 
to use this equivalence of dimensions we shall not bother with the proof here. 

The following lemma provides the remaining two tools needed for the forth- 
coming analysis. 


Lemma 8.2.4 


Let F: 4" — 9f! be continuous on 1,. Then 


(8.2.7) dim(Im F) > « it f Í | F(t) — F(s)||7* dt ds < œ, 
Io¥ Io 


(8.2.8) dim(Gr F) > « it f Í I(t, F(t) — (s, F(s)|| * dt ds < œ, 
L,J I, 
where (t, F(t)) = (t... ty, Eilt), Ft) E BX Te 


Proof 


From the previous lemma, to prove (8.2.7) we need only exhibit a countably 
additive. positive mease, say. supported by Eni F for which the corresponding 
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energy integral 1,(u) is finite. To do so we simply take the image of Lebesgue 
measure in J, under the transformation t —^ F(t). Then u(Im F) = (2°) = 
AT) = 1, and 


Í Í ix — yl -*duGo dyly) = Í Í |F(t) — F(s)j -* dt ds 
Im FY ImF Ios Io 


which is finite by assumption. Hence dim(Im F) > aas required. 
To prove (8.2.8) use the same argument with the transformation t — (t, F(t). 


8.3 INDEX-B GAUSSIAN FIELDS 


It is immediately apparent from Lemmas 8.2.1 and 8.2.2 that the level of 
erraticism exhibited by a non-random function is intimately related to the 
local growth of the function as measured through Hólder conditions. Thus, in 
order to study the erraticism of a random field, it will be necessary to isolate 
some aspect of the fi-di distributions of the field which determines whether or not 
its sample functions will satisfy a Hólder condition of some particular order. 
If we restrict our attention to Gaussian fields with zero mean, it is reasonable to 
expect, from the analysis of Chapter 3 (especially Theorem 3.3.2), that in this 
case it will be the behaviour of some multidimensional generalization of the 
incremental variance function (3.3.5) that will be the determining aspect. 

To formalize this, let us write X(t) = (X,(t,..., X,(t)) to denote an °- 
valued Gaussian field on 1, c 4". That is, X is an (N, d) Gaussian field. If we 
assume, as we now shall, that X has mean 0, it follows that all of its stochastic 
properties are determined by the covariances E{X,(t)X (s)), i, j = 1,...,d. 
However, we shall see in the following theorem that it is only a subset of these 
covariances that determines the existence, or otherwise, of Holder conditions. 
In fact, the determining factors turn out to be the d incremental variance 
functions 


(8.3.1) ois t = E(IX(S — X0), — fori 1,...,d. 


To further simplify our study, we shall assume from now on that the co- 
ordinate fields X,,..., X, have homogeneous increments. By this we mean that 
each o,(s, t) is a function ofs — t only, so that we can write 


(8.3.2) oit = E(IX(s--0— X(SI2), — fori L...,d. 


The following theorem, due to Sirao and Watanabe (1970) for N = d= 1, 
shows that the local growth of the sample functions is determined by the be- 
haviour of the c; near t = 0. Furthermore, this theorem indicates the existence 
of a special type of dichotomous behaviour often exhibited by Gaussian pro- 
cesses—either, with probability one, all sample functions possess a certain 
property or none do. (This seems to have been first noted by Belyaev, 1961, 
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who proved that for homogeneous Gaussian processes on 2?! one of the follow- 
ing alternatives takes place: either all sample functions are continuous or all 
sample functions are unbounded in every interval of finite length. For more 
recent extensions of this idea to more involved properties, see, for example, 
Cambanis and Rajput, 1973, Jain, 1971, and Kallianpur, 1970.) 


Theorem 8.3.1 


Let $ be a continuous function defined on (0, 1) and X a zero-mean, homo- 
geneous, real-valued Gaussian field on 9£", with a continuous covariance function. 
For a > 0 let E(a) denote the event 


X(t, o) — X(s, o). E 
| ellit — sli) :$te L,0 « |t- sl] < i] < 2 


Then if o(h)/ (4) tends to zero with h, it follows that, for any a > 0, P{E(a))} = 0 
or 1. 


E(a) — fo: lim sup 
hlo 


Note that since it is precisely events like E(a) that determine the local growth 
of the sample functions, the theorem provides information about the type of 
result we are seeking. Furthermore, there is no loss of generality involved in 
dealing only with real-valued fields, since an (N, d) field is simply made up from d 
real-valued fields, and the zero—one nature of the theorem immediately allows 
us to apply it to this case as well. 


Proof 


From the Karhunen-Loéve expansion of X (Theorem 3.3.2) we have 
(8.3.3) X(t, œ) = Y AI? $0040), 
j=l 
where the sum converges uniformly in t € I, for almost all c». The A; and $; are, 


respectively, the eigenvalues and eigenfunctions of R, and 0,, 0,,. .. isa sequence 
of independent N(0, 1) variates. Now, by (3.3.1) and the homogeneity of R, 


IPn(t) — 6491 = 


À, f [R(t — 1) — R(s — 916.9 dt 


1/2 
< ^ff [R(t — € — R(s — 9p a) 
lo 


by the Cauchy Schwartz inequality and (3.3.2). The last expression is equivalent 
to 
l 1? 
i [ PEND Visa} Nen! del 
of 


A,R'"(0)(t s) 
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by the Cauchy-Schwartz inequality and the homogeneity of X. Thus 


(8.3.4) |6,(t) — ,(8)| € A, R? Oott — s). 
Now let n > 1 and let E,(a) be the event defined by 
E,(a) = 


[oiim wp| Š y” o 6(@):s,te1,,0 < |s — t| € JE d} 


By (8.3.3), E(a) and E (a) are equivalent up to a set of probability zero; i.e. 


P{E(a)} = P{E,(a)} 
But (8.3.4) and the assumption on o(h)/#(h) tell us that 


E,(a) = E,(a), forn = 1,2,3,.... 


Thus E,(a) is a tail event, since it depends only on 0;(c) with j > n. Hence, 
applying the Kolmogorov zero-one law (Lemma 1.4.3), P{E,(a)} = 0 or 1, 
as must P(E(a)), and we are done. 


With the knowledge that the local behaviour of a Gaussian field is determined 
by its incremental variance function we now introduce a special class of such 
fields, characterized by a power-type behaviour for the c;. The remainder of the 
present section as well as a substantial part of the following sections will then be 
devoted to a detailed study of their sample function properties. 


Definition 8.3.1 


Let X(t) be an (N, d) Gaussian field, such that each X; has zero-mean, stationary 
increments, and a continuous covariance function. For each i= 1,...,d set 
o?(t) = E{| Xs + t) — X(s)). Then if for each i = 1,...,d there exists a fi 
such that 


(8.3.5) f; = sup{B: oft) = oCIti^), [it] | 0} = inf {B: |t]? = o(o.t), iiti L0). 
and 0 < f; x 1, we call X an index-B Gaussian field, for B = (B,,.... Ba). 


An important special case of (8.3.5) occurs when 


c(t) — 


8.3.6) Tok C5 
( tigo ltl 


for each i, and 0 < c; < oo. Note that if f; = 0 then X, is a degenerate field. If 
f; = 1 the sample functions of X; do not generally exhibit significant erraticism. 
If f; > 1, X; is not a properly defined process as its ‘covariance function’ is no 
longer positive definite. 
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A specific example of an index-f field is the (N, d) Gaussian field whose 
coordinate fields have the covariance functions 


(83.7) RAs, t) = E(X(S)X(0] = zalt? + Is|?^ — |t — si^. 


This field actually satisfies (8.3.6) with equality for each t. When N =d = 1 
and f, = 3 this is simply the covariance function of a Brownian motion. When 
d = 1 and fj, = 4 the resulting real-valued random field is known as the Lévy 
N-parameter Brownian motion, and was first studied by Lévy (1948). It is also 
known as the isotropic Brownian motion since its probabilistic behaviour is 
unaltered by a rotation of its parameter space A. 

Another example, first given by Berman (1972) for N = d = 1, is the #*- 
valued stationary Gaussian process on 2! whose ith component has spectral 
density function |A|! -?^(1 + |A|?)~ £. In this case (8.3.6) is satisfied with 


(8.3.8) g= { le^ — 1/7 | A778! dA. 
To see this, note that, dropping the subscript i, 
(8.3.9) o7(t) = 2[R(0) — R(t)] 
z oo (1 = e^) A|1 7 2p 
zz | 012 — u 
zl (1 — cos At — isin At)|A| ^ ?? 
5 A 
1 — A? d 
© (1— cos AD)|A|1  ?? 
A 
a 1-2 ad 


where the last equality follows from symmetry considerations. But the last 
expression is no more than 


f [elt — 1121417 2p 
e 14-2 


A simple change of variables from 4 to A/t suffices to establish that (8.3.6) holds 
with the c; of (8.3.8). 

We shall now use (8.3.5) to prove Hólder-type properties for the sample 
functions of index-B fields. For this we need a definition. 


då. 


Definition 8.3.2 


We shall say an (N, d) field satisfies a uniform stochastic Holder condition of 
order a if, with probability one, its sample functions satisfy a usual Holder condition 
of order a, with the exception that the constant A in (82.1) is replaced by an almost 
surely finite, positive. randum variable. 
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We can now place the following simple limits on the erraticism of an index-B 
field. 


Theorem 8.3.2 
Let X be an index-p, (N, d) Gaussian field. Then X satisfies a stochastic Holder 
condition of order a for every a with a; < Pi, i = 1,...,d. Furthermore, if o; > Pi 


for any i, the sample functions fail to satisfy any uniform Hólder condition of 
order a. 


Proof 


Clearly we need only treat one component at a time, so we may drop the 
subscript and assume X is an (N, 1) field with incremental variance function 
c?(t). Let p(u) be the function on #! defined by 


(8.3.10) p(u) = max{o(t): |t] < lul / N). 


Then according to Theorem 3.32 (especially 3.3.23) there exists a finite, positive 
number C, and an almost surely finite, positive random variable B such that, 
with probability one, 


(83.1) ch) = sup{|X() — X(l:s te L, ls — tj < h} 
< Bp(h) + C, Í (—log u)!? dp(u) 
0 
= Bp(h) + C,[(—log u)? p(u)]S 
+ ic. f (toe u) /?u~' plu) du 
0 


where the last line is obtained via an integration by parts. But (8.3.10) and the 
condition on o imply that there exists a finite C, > 0 such that, for any 
Bp >O,¢>0, 


pu)u| 9^9? < C, as u 0. 
Thus, as h > 0, 
[(—log u)'/*p(u)]§ x Cn? *? 


h h 
[ tos u) "u^ !p(u) du < Í u 1*9-*dg = (B — ey Ihe E. 
0 0 


Substituting the above three inequalities into (8.3.11) implies that, with proba- 
bility one, @(h) < AR? ^*? as h — 0, where A is an almost surely finite random 
variable. Thus X satisfies a stochastic Hólder condition of order æ for every 
a < fl, and the first part of the theorem is established. 
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To prove the second part, we again take X to be an (N, 1) field and note that 
it is sufficient to prove that, for example, the ratio [X(t) — X(0)]/|t]^** is 
almost surely unbounded as t — 0. But this 1s easy, since this ratio has a zero- 
mean normal distribution, with a variance of O(||t|| -?*) ast —^ 0, and so becomes 
almost surely unbounded in the limit. This completes the proof. 


In Section 8.8 we shall prove a result much stronger than the mere failure 
of a uniform stochastic Hólder condition on I, when o; > f;. There we shall 
prove, using local time methods, that such a condition fails to hold, with 
probability one, for every t in J, (see especially 8.8.26). 

Results like Theorem 8.3.2 hold for essentially all (N, d) Gaussian fields, not 
just index-B ones. However, as one would expect from Theorem 8.3.1, rather 
than being stated in terms of Hólder conditions the more general results relate 
to the behaviour of | X;(t + s) — X,(s)|/o,(t) as t 0. We shall not treat this 
more general case here, however, but refer the interested reader to Geman 
(1977a and b, 1980), Geman and Zinn (1978), and, particularly, to the extensive 
survey of Geman and Horowitz (1980). 
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We shall now see precisely how well the notion of Hausdorff dimension serves 
as a measure of the amount of erraticism exhibited by index-f Gaussian 
fields. Throughout this section X(t) = (X,(t),..., X,(t)) will denote an (N, d) 
index-B Gaussian field, so that it has zero mean and homogeneous increments. 
To avoid degeneracies, it will be necessary to place some restrictions on the 
type of dependence allowed between the coordinate fields X,,..., Xa. In 
particular, if we write X(t) for the covariance matrix of the vector X(t) — X(0), 
so that the diagonal elements of X(t) are the incremental variance functions 
oz(t) = E{|X,t) — X,(0)|?}, we shall require that there exists an e > 0 such 
that, for all te I* = [— 1, 1]", 


det X(t) 


=r 2 &. 
fay c;(t) 


This condition will be satisfied, for example, if the coordinate fields are 
independent. In this case X(t) is simply a diagonal matrix with diagonal elements 
o?(t) so that (8.4.1) holds with ¢ = 1. In the following section we shall consider 
a specific example in which the coordinate fields are not independent but 
(8.4.1) holds. 

The following result was first proven for N = d = I, $ = 3 (so that X is 
simply a Brownian motion on the line) by Taylor (1955), and Kahane (1968) 
proved a version of it for general d. Yoder (1975) looked at the case of general 
N, d and B (V... 1). Adler (1976b) considered Bo (fh... f) while the 
version given here is dite to Cuzick (1978). 


(8.4.1) 
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Theorem 8.4.1 


Let X be an (N, d) Gaussian field on I, of index f, with coordinates so arranged 
that the B satisfy 


(8.4.2) Ds emere eT. 


If (8.4.1) holds and the ot) are bounded away from zero on I* = [—1, 1]" for t 
bounded away from the origin, then, with probability one, 


(8.4.3) dim(Im X) = minfa aus Li (Ba = el 


(844) — dim(Gr X) = E the Oe Bs Sa P| 


dim(Im X) if dim(Im X) < d, 


il 


N+ x (1—f) X ifdim(Im X) = d. 


This theorem has the following trivial corollary whose content is perhaps more 
transparent. 


Corollary 


Let X be as in the theorem, and suppose that each coordinate field has the same 
incremental variance function, with the same index p. Then, with probability one, 


dim(Im X) = min( 4 3). 


dim(Gr X) = minl 7, N + d(1— p| 


Proof 


That the right-hand sides of (8.4.3) and (8.4.4) serve as almost sure upper 
bounds to dim(Im X) and dim(Gr X), respectively, is an immediate consequence 
of Theorem 8.3.2 and Lemma 82.1. We need only show that they also serve as 
almost sure lower bounds. 

Consider Im X. Then we can differentiate between two cases. Suppose, firstly, 
that the right-hand side of (8.4.3) equals 1, which is its smallest possible value. 
Then we have dim Im X < 1 a.s., and we need only prove the reverse inequality. 
But then d = 1 and/or N = ff, = 1, and the almost sure sample path con- 
tinuity of X automatically yields dim Im X > | with probability one, which 
establishes (8.4.3) in this case. 


8.4 INDEX-D FIELDS AND HAUSDORFF DIMENSION 205 


For the second case, we assume the right-hand side of (8.4.3) is strictly greater 
than 1. Then, according to Lemma 8.24, to prove the result we are seeking it is 
sufficient to show that 


(8.4.5) Í Í E{|IX(t) — X(s)] *) dt ds < œ 


for all 1 «x < min{d, [N + 4, (Ba — B)]/B4). But since the X; have 
stationary increments, we need only establish that 


(8.4.6) Í E([(X,(t) — X,(0) +- + (X,() — X0] dt < o. 
I$ 


Now make the change of variables Y(t) = o; (A(X — X;(0), i = 1,...,d. 
By (8.4.1) we have that the covariance matrix of Y(t) has a determinant of at 
least £, so that its (multivariate Gaussian) density is bounded. Hence it is suf- 
ficient to establish that 


(8.4.7) NNI Tct [y40,(0)]?) *? dy dt < oc. 


By assumption the o;(t) are bounded away from zero for t away from the origin, 
so that the integral over I* need only be considered over I? = [— ô, 9]" for 
ó » 0. Also 


c(t) > K|t||" for |t] < ô for some ô > 0, K > 0, 
with y; > f; for B; < 1 and y; = 1 when £; = 1, and y € y; € Sy <1. 
Thus by (8.4.7) it is sufficient to establish the finiteness of 
(8.4.8) Í iq cm f [vt + Gilt? 77)?  --- + Oalltl ?)*]7*? dy dt. 
I$ E 


Using the fact that [7 (a + y?) ? dy = C(ó)a ?* !? for ô > $, we can integrate 
out y, to obtain that (8.4.8) is less than a constant times 


[icf [Gat]? 7? + ++ + Qulitip Port 9? dy dt. 
HA pd-1i 


If we now integrate out y,,..., yy, in the same fashion, and assume a < d so 
that the remaining integral with respect to y, is finite, we find that (8.4.8) is less 
than a constant times 


(8.4.9) Í it art 2m e* D+ Fam ya- atd- D dt, 
i 


Since the exponent telescopes to 
d 1l d 
J hu | (d ] Wa Y Y ( vd i) tae 


© rod 
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(8.4.9) will be finite if a < (N + $=; (ya — y))/ya. Letting y; | f; gives the 
required result for Im X. 

To find dim(Gr X) we first consider the case dim(Im X) « d. Since dim(Gr X) 
> dim(Im X), it follows from Theorem 8.3.2 and Lemma 8.2.1 that 


min = + x (a — Bi) N+ X  -— pa| > dim(Gr X) 


> dim(Im X) 


ON E YLa Bi - B) 
Ba 


with probability one, which implies the required result. 
When dim(Im X) = d the dimension of the graph can be larger. Then using 
Lemma 8.2.4 and carrying out similar manipulations as above on the integral 


Í Í E([IX(t) — X)? + It — si?] ^75 ds dt 


will verify that dim(Gr X) > N + 7. , (1 — fj), and so complete the proof of 
the theorem. 


We now turn our attention to the random sets of primary concern to us, 
the excursion sets of X, or, rather, the level sets which bound them. It is actually 
somewhat more difficult to obtain dimensional information for these sets than 
for the image and graph of X, and what information can be obtained is not as 
complete. However, we shall start with the following result, in which we write 
Cov(Y) to denote the covariance matrix of a random vector Y, while X^ !(u) 
denotes the usual level set ft € J,: X(t) = u}. We shall discuss the content of this 
theorem following its proof. 


Theorem 8.4.2 


Let X be as in Theorem 8.4.1, with the added assumption that X is homogeneous 
and with (8.4.1) replaced by 


det Cov(X(t), X(0)) 


8.4.10 
) i-i o; (t E 
T hen, for almost every ue R’, 
d 
(84.11) dim(X^ (u)) = max. N- Y n) 
zt 


with positive probability. 
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Proof 


We prove (8.4.11) in two stages. One is easy. Since X is an index-f field, its 
sample functions satisfy a uniform Hólder condition of order @ for every & 
with a; < fj, i = 1,...,d (Theorem 8.3.2). Thus Lemma 8.2.2 tells us that, with 
probability one, 


d d 
dimX-'w)<N- YB, iN-» fj0 
i21 i21 
for almost every u e 2°. Thus we have half proven the theorem. To complete 
the proof we must show that the reverse inequality holds with positive proba- 
bility. 

According to Lemma 8.2.3, in order to do this it is sufficient to exhibit a 
countably additive, finite measure, u say, supported by X^ !(u) and for which the 
energy integral 


(84.12) LU) = Í I5 = tl" dats) duo 
X-!(p? 


is finite whenever « x N — Vis | Bi. That is, u has finite « energy. Since X^ !(u) 
varies with each œ, we shall actually need to construct a family, u(., ©), of random 
measures satisfying (8.4.12) for almost all c». We shall only be able to show, 
however, that the measures we construct are strictly positive on an c set of 
positive probability. Since the construction proceeds via a limiting argument, 
we must take a moment off to determine some properties of measures of finite 
f energy. 

Let Mp denote the space of all measures on 4?" of finite f energy. Then it is 
possible to equip .@, with an inner product 


- c 
wx) = [ax vitai ave) 


with which it becomes a Hilbert space. However, more than this is true, for if 
we write Mg for the subset of non-negative measures in Mg, then Mg isa 
complete metric space with the induced metric ||- ||; given by 


(84.13) luli =) = ie IN: — yl" du(x) du(y). 


This means, as usual, that if 4,, 42... is a Cauchy sequence in Mg —ie. if 
liy, — Hnila > 0 as m, n — oo - then there exists a limit measure p € Mg such 
that |j — lle > Oasn — o. The completeness of Aj is neither a trivial result 
to establish nor i$ it obvious, (For example, the larger space . A, is not complete 
if ff > 1.) We shall thus use it here without a proof, an example of which can be 
found in Landkof (1972, p. 90). 
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We can now proceed with the construction of our measure on X^ !(u). To 
simplify the notation, let us assume for the moment that u — 0 and define a 
sequence of random measures y, on the Borel sets of A by 


(8.4.14) M(B, œ) = P(t, c») dt, 


TgaB 


where, for each n > 1, ¢,(t, œ) is the density 


2 
(8.4.15) Palt, ©) = Qnny? exp| - eor | 
We shall show that, with probability one, the 4, converge in the metric ||- ||, 
to a measure u € M3 supported on X^ !(0) whenever a < N — Y2., fli. 

For each fixed t note that $,(t, œ) is, except for a constant factor, simply the 
value of the density of d independent, N(0, n^!) variates at the point X(t). 
(As n — oo it approaches, in an undefined sense, the Dirac delta function on Z4. 
It is essentially no more than a close relative of the ó, function used to count the 
zeros of functions in Theorem 5.1.1. In fact, we could replace $, by such a 
function, with support in the sphere c(n^ +), without altering the forthcoming 
analysis in any significant manner. Such a replacement would, however, 
complicate the algebra.) 

Using the form of the multivariate Gaussian characteristic function we can 
rewrite $,(t) as follows: 


(8.4.16) p(t = Í, exp| - “ot LETY xo] du. 


We claim firstly that u, € 4/7; for each a < N — )4_, B;. Since it is clear that 
H,(B) > 0 for every n and B, we need only show that, with probability one, 
lanla < oo if «< N — Yi-if;, which will follow if Ef|u,l2) < co. But, 
applying (8.4.16), 


(&417) Efm l2} = Í Í Eld,(t)o,(s)}|)t — sj "* dt ds 


= SS Sadel- D ED etespti - X(t) 


+ H- X(s)]}\|t — s|| * du dá dt ds 


< f LL renti - X(t) 


t ü-X(s)])t — si|^* du dû dt ds. 
Consider the expectation here. We can write it as 


(84.18)  Elexp[i(u- X(t) + à- X(s))]] = Elexp[i(u, à) -(X(t), X(s))]) 
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which, by (1.6.4), is simply 
exp[ — &(u, à)Cov(X(0, X(s))(u, 6)" ], 


using the notation described prior to the statement of the theorem. Integrating 
this expression over u and @ gives 


(2n)*/? |det Cow(X(t), X(s))|- !? 


and this, under the conditions of the theorem, is bounded by C | [7. , e; (t — s) 
for some finite C. Substituting this back into (8.4.17) yields 


d 
Etui < Cf | ie sh- Hor € ~ 9 dt ds, 
o* Io i-1 
which will be finite if 
d 
iti" Tec SQ at < o. 
I$ i=1 
Converting the integral to polar coordinates shows that this will be the case if 
1 d 
Í r^ !-7*[[o?!() dr < oo, 
0 i=1 
and this will occur if N — 1 — « — 92., f; > —1, on using the fact that X is an 
index-B field. That is, if & < N — »2., fj; then each y,(-, œ) is, on an o set of 
probability one, an element of M}. 


We shall now show that a subsequence of {,(-, @)} is Cauchy in .@; for 
almost every c. But this is easy, since 


(419) Ells = tml?) = | | ETO - AAO 


— Pmt) |t — si|” ds dt 


= Í Í I | E{exp[i (û - X(s) 
oY Io Riv AI 
+ u- X(0)]J|t — sj ~ daml, 6) du dû ds dt 
j ill? 
d, (u, à) = [ef aa) ew a J 
7 |[(- b E a- | 
n m 


(uu) 0 


where 


Since 
lim. lim d,,, 


"nc D 
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it is clear, from dominated convergence, that (8.4.19) will tend to zero as n, 
m — oo if 
Í Í Í f E{exp[i(u - X(s) + à- X(0)]Mt — si| ^ * du dá ds dt < oc. 
o o Riv Ra 
But this is simply the final expression of (8.4.17), which we have already shown 
to be finite when « < N — Y2., fj. Thus, under this condition, 


lim lim Elus = Mmll2} = 0. 


noo m-*oo 


Now choose a subsequence n, tending to infinity so that 


Ellu,, = Pacis 2} < 25 


Applying the Cauchy-Schwartz inequality and summing yields 


H pi UN Pe hes] < X (VD « oo 
k=1 k=1 
so that 


De 
> We — Ansila < © a.s. 
k=1 


and (j4,) is almost surely Cauchy in Z7 . Denote the limit (random) measure by 
HC, œ). Then since the densities $, tend to zero outside any neighbourhood of 
X- (0), i.e. 


lim | ¢,(t)g(t) dt = 0 


no VI, 


for any continuous function g that vanishes in a neighbourhood of X ^ !(0), it is 
straightforward to see that p(-, œ) has its support in X^ (0) with probability one, 

The last point to prove is that u is non-zero with positive probability, For 
each n > 1 we have 


Elu) = Í E{o,(t)} dt 


E f 2x(- x E{exp[iu  X()]) dt du 


= Í epl- H +u Cove | du 
EL n 


using (1.6.4) and the homogeneity of X. On integrating out u we find 
(8.4.20) Eiu,)) = Qn)" tdet[n. + Cow(X(0)]; !? 
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where I represents the identity matrix. If we set 
K = Qn)? [det CovX(0))]- !? 
then, since K is bounded away from zero, we can, in view of (8.4.20), write 
E{u,(I,)} =K + 6,» 0 


where £, > 0 as n > oo. Furthermore, as in (8.4.17), 


E(In 0313 x Í f Lf Eiet - X(t) + à - X(s))]} du dû dt ds 


which, if a < N — Èf =, i» is finite. Thus, by the inequality (1.3.3) there exists 
a strictly positive p satisfying 


Píu() > 3} > p 


and independent of n, as long as n is large enough. That is, on an w set of proba- 
bility at least p, u,(1,, c) > 3 for sufficiently large n. Thus the limit measure 
WI., ©) is at least 4 for an œ set of the same probability, and the proof is complete 
for the case u = 0. 

To treat general levels u, it is sufficient to replace the density , of (8.4.15) 
with 


$,t, o) = (2mnyexp| aX. o) — 4 


and otherwise proceed as above. This establishes the theorem in general. 


Let us now pause to consider the content and implications of the result we 
have just established. Firstly, it is important to note that not only do we not have 
a result that has been proven to be true for all levels u, but when the result does 
hold, i.e. dim X^ (u) = N — $2. , fj, it holds, with positive probability, only 
for one level at a time. A more powerful result would be of the form 


d 
(8.4.21) Plo almost all ue 2^, dim X^ ((u) = N — Y 2 > 0. 
i=l 


We shall call a result of this form a uniform dimension theorem, and, later in the 
chapter, we shall indicate how one might go about proving this stronger result. 

The second point that is glaringly obvious is that unlike the almost sure 
results of Theorem 8.4.1 the conclusion of Theorem 8.4.2 holds only with 
positive probability. In fact, it is casy to see that this probability is strictly less 
than one, for if a point u lies outside the image of X, the level set X^ !(u) must be 
empty, and so have dimension zero. Thus, for a given u, 


l | 


d 
pldim X ' N Y B. l - PINO an some tc h! 
4 I 
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and the rightmost probability is generally less than one. To raise the status of the 
result to an almost sure one, we could consider events such as 


(8.4.22) faim X-'(u) = N — Y f; for all u e Im x. 


i=1 


We shall also take up this point later in the chapter. Another alternative, 
however, would be to consider the level set of X over a larger domain than just 
the unit cube, and consider, say, the level set 


(8.4.23) L,(T) = (te 4*:0 < t; < T; X(t) = u}. 


As T > œ, it becomes more and more likely that L,(T) is not empty for any u, 
so that we might expect that dim L,(T) eventually becomes N — 32. , fj; with 
probability one. That this is actually the case is verified by the following theorem. 


Theorem 8.4.3 


Let X be as in Theorem 8.4.2 and suppose, furthermore, that X is ergodic. Then 
with L,(T) as in (8.4.23) the following holds for almost every u with probability 
one: 

d 
(8.4.24) lim dim L;(u) = N — Y fj. 
i=1 


To m 


Proof 
Let I, denote the cube {te 4*":k, — 1 < t; < kọ i= L..., N} and set 


IT] IT] 
SE 9 TE U Ty. 
ki=1 knw=1 
Then by Theorem 8.4.2 there exists a ó € (0, 1] such that for each k with integer 
components 


P{dim(t € Ix: X(t) = u) = f*} = ô, 


where f* = N — Y1., fi. 
Now let Y, be the indicator variable for the event dim(t € Ix: X(t) = u) = fi*. 
Since X is ergodic it follows that 


[7] I7] 
T~} - } XoE(YX)-2óas  asT o. 


ki=1 kn=1 


But this means that, with probability one, for large cnough T at least one of the 
sets {te fy: X(t) = u}, 1 < k; € T, must have dimension f*. Since for any sets 
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A, B it is always true that dim(A o B) > max(dim A, dim B) it thus follows 
from Theorem 8.3.2 and Lemma 8.2.2 (extended to cover the case when the 
domain of X is S, rather than Z) that dim(t € Sy: X(t) = u) = f* almost surely 
for large enough T. This proves (8.4.24) and hence the theorem. 


Finally, let us consider the condition (8.4.10) of Theorem 8.4.2, which re- 
places (8.4.1), the condition we had assumed earlier in the section. Both of these 
conditions were intended to limit the degree of dependence among the com- 
ponent processes of X. To display the relationship between these conditions, we 
shall prove the following lemma. 


Lemma 8.4.1 
Condition (8.4.10) implies (8.4.1). That is, if t € Iy then 
det Cov(X(t), X(0)) det Cov(X(t) — X(0)) x 
Lio ^^^  qKaneo C 


for some £* > 0. Furthermore, both conditions are satisfied when the coordinate 
fields are independent. 


(8.4.25) 


Proof 
We start by defining two d x d matrices, V and R, by 


V = (v) -(CE(X(0X(0)), R = (ry) = (E(X(0X (0)). 


Then it is straightforward to check that the following two equalities hold: 


Cov(X(t), X(0)) = (k v) 


co«xto - xw xay - (^ - 9 BLY) 


(The matrix 2(V — R) is, of course, simply X = Cov(X(t) — X(0))) 
Furthermore, denoting again thed x d unit matrix by I, it is easy to check that 


(le v» dy v) 


(8.4.26) det Cov(X(t), X(0) det Cov((XN()— N(0)), X(0)). 


so that 
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Now suppose the component fields X,,..., X; are independent. Then by 
the above the numerator of (8.4.25) reduces to 


eit) i dei 


jo; (t) Vaa 


where v; = E{X?(t)}. But this is simply 
d d 
WOL os — 4070]. 
i=1 i=1 


The factors in the second product are all strictly positive, so that we have that 
this ratio is at least c*, where e* satisfies 


d 
0«zc* x inf [I [vi — 40; (t)]. 


teš i=1 


That is, (8.4.10) is automatically satisfied if the coordinate fields are independent. 
We now prove the implication in (8.4.25). But this is easy since from (8.4.26) 
we have that 


XV-R) V-R 
det Cov(X(t), X(0)) — 
et Cov(X(t), X(0)) pe ` | 
| XV - R) 0 
I V-R XV-«R) 
Since det(5(V + R)) can be neither infinite nor zero, we immediately obtain 
the implication, and so the lemma. 


= det X - det(4(V + R). 


Unfortunately, not only is (8.4.10) a stronger condition than (8.4.1) but it is 
sufficiently stronger to substantially weaken the effectiveness of Theorem 
8.4.2. For example, in the following section, we shall want to study the level sets 
ofa particular field whose covariance satisfies (8.4.1) but not (8.4.10). Fortunately, 
however, a better version of Theorem 8.4.2 exists. Cuzick (1980) has developed 
a different approach to the capacity method we have used to obtain a lower 
bound on the dimension of X^ !(u), as well as relaxing the ‘almost every w part 
of the result. His technique is based on a representation of X similar in spirit, but 
finer in detail, to the Karhunen- Loéve expansion of Theorem 3.3.2 (see Klein, 
1976, and Cuzick, 1978, for details). This technique is powerful enough not only 
to obtain a stronger result but also to replace (8.4.10) in Theorem 842 with 
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(8.4.1). We shall not reproduce his argument here, but shall quote, and soon use, 
his result. This is as follows. 


Theorem 8.4.4 


Let X be as in Theorem 8.4.1, with the added assumption that it is homogeneous. 
Then, for every u € d, 


(8.4.27) dim X^ '(u) = mo EE b n) 


Hi 


with positive probability. 


If we add the assumption that X is ergodic, a corresponding version of 
Theorem 8.4.3 holds as well. The proof is just as before. 

Before we leave the theory of these results, it is worthwhile recounting their 
long and interesting history. Theorem 8.4.2 was first proven for N —d—1 
and f = 4 (Brownian motion on the line) in a foundation-laying paper by 
Taylor (1955). Orey (1970), Berman (19702), Marcus (1976), and Hawkes (1976) 
all looked at various versions of Theorem 8.4.2 when N = d = 1 but f was 
arbitrary. Kahane (1968) treated thecase N = 1,d > Lbutf, = fl; =--- = ff, 
while Adler (1977b) let N also be greater than 1. Both of these latter workers, 
however, assumed that the coordinate fields X,,..., X, were independent. 
Cuzick (1978) lifted the independence assumption, introduced conditions (8.4.1) 
and (8.4.10) and stated our Theorem 84.2. As we have just noted, Theorem 8.4.4 
is due to Cuzick (1980). The technique used to prove Theorem 8.4.3 seems to have 
been first used by Orey (1970) for N = d = 1, and then for the more general case 
by Adler (1977b). 


85 AN EXAMPLE RELATED TO SUITABLE REGULARITY 


The primary aim of Chapters 4 to 7 was to develop, for random fields, a theory 
analogous to the level crossing theory of stochastic processes on the real line. 
Throughout that development, the regularity conditions we were forced to place 
on the sample functions of our fields were somewhat more stringent than those 
required in one dimension. For example, Theorem 4.1.1, which gives the mean 
number of level crossings of a stationary Gaussian process on the real line, 
requires only that the sample functions be almost surely continuous to hold. 
The resultant expectation is finite if the process possesses a mean square 
derivative with finite variance, and so, a fortiori, if the sample functions possess 
continuous derivatives with probability one. Theorem 5.3.1, however, which 
gives the mean DT characteristic of the excursion sets of a Gaussian field, 
requires the almost sure existence of sample functions which possess continuous 
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partial derivatives of up to second order. That is, we require an extra set of 
derivatives in the N-dimensional (N > 2) setting. 

There were two main reasons for requiring the existence of these second-order 
derivatives, most easily described in the two-dimensional setting. Here we were 
counting the number of points t* € I, for which 


_ ox (t*) _ 


(8.5.1) X(t*) —u X, 


0 


and thecontour line ft: X(t) = u} was either convex or concave in the neighbour- 
hood of such a t*. The first reason, then, for assuming second-order derivatives 
was to distinguish between convexity and concavity in a simple fashion. One 
could hope that with a more careful, although probably more complex, analysis 
it would be possible to dispense with second-order derivatives and still manage 
this discrimination. 

The second reason for this assumption was to prove, along with other 
restrictions, that the number of points t* € J, satisfying (8.5.1) was finite with 
probability one (cf. Lemma 4.3.3). What we shall now show, by means of a 
specitic example, is that if we fail to insist that X possesses second-order partial 
derivatives we can have an infinite number of t* satisfying (8.5.1). Indeed, the 
solutions of this equation turn out with probability to have positive dimension, 
so that they are even uncountable. This being so, the very definition of the DT 
and IG characteristics of excursion sets becomes meaningless. 

In particular, we shall now prove the following theorem. 


Theorem 8.5.1 


Let X(t) be a homogeneous, isotropic, real-valued field on the plane. Suppose 
X(t) = 0X(t)/Ot, exists and 


(8.5.2) ao7(t) = E{|X,(t) — X,0)/j 
is such that 


0 
(8.5.3) lim $165 O _ 


m. C for some 0 < C < o» and 0 < f < 1. 
t110 1 


T hen if o (t) is bounded away from zero on [ — 1, 1]N for t bounded away from the 
origin, the dimension of the set of t* € I, satisfying (8.5.1) is 1 — P with positive 
probability for every u. 


Proof 


The proof of this result rests on defining the (2, 2) field X = (X, X4) and 
applying Theorem 8,42 to it. To do this, we note firstly that X is an index-(1, f) 
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field. That X, has index f follows from (8.5.3) and the fact that isotropy implies 
a(t) = a,(\\t}, 0) for every t. That X itself has index 1 can be seen by noting the 
existence of 0X/ét, and isotropy implies that the covariance function of X has a 
Taylor expansion, at the origin, of order at least two. However, the term in 
|t]! must disappear, or else X would have index 3 and so, by Theorem 8.3.2, 
could not possess sample function derivatives everywhere. Thus the term in ||t|? 
is the dominant one, so that X has index 1. 

Thus the conclusion of the theorem will be an immediate consequence of 
Theorem 8.4.2 if we can show that condition (8.4.1) holds; i.e there exists an 
e > 0 such that, for all te [— 1, 1]", 


det X(t) 
(8.5.4) MEE = & 
where X(t) is simply the 2 x 2 covariance matrix of (X(t) — X(0), X,(t) — 
X ,(0)). That this is true follows from Lemma 8.5.2 below, which we state and 
prove following the preliminary Lemma 8.5.1. 

We start by noting that since X is isotropic Theorem 2.5.3 yields the repre- 
sentation 


R(t) = (^ At aca) 
0 


for its covariance function, where Jo is a Bessel function and G is bounded and 
non-decreasing. Furthermore, if G, G,,G,,...are functions with these properties, 
we say the G, converge weakly to G, written G, > G, ordG, > dG, if G(x) > G(x) 
as n > co at every continuity point x of G. 


Lemma 8.5.1 


Under the conditions of Theorem 8.5.1, as t > 0, 


dG(ltl ^ 4 p- z 
(8.5.5) ddjexo > Kg 1($)A 3*2 dA 
where 
p= (5) 
ti 
and 


ut 2n 
Kah) - 5 [ [ cos?(0 + Pf - cos(A cos NJA [128 dO dA. 


Foro- Ro T, Kus bounded away from zero and infinity for all i. 
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Proof 


Let F(A) be the spectral distribution function in the usual representation of 
R(t), so that G(A) = fian dF) (cf. the proof of Theorem 2.5.3). Let t* = 
(tı, 0). Then (8.5.3) implies that, for any real s, 


t* 
a s 


Also, as in (8.3.9), it is easy to see that since the covariance function of X, has the 
spectral representation f 42 exp(it - 2) dF(A) we have 


cit*) | Xu dF(X) 
ci(t*) EL Rue: cit*)' 


If we now transform to polar coordinates and then the substitution |jA|| > 4/t,, 
this equals 


(8.5.6) as t* > 0. 


dG(A/ti) 


P) a?(t*) dé. 


1 2n po 
— | Í A? cos? 6[1 — cos(sd cos 6)] => 
2m Jo Jo 

However, from the definition of K,($) we have 

1 2n (oo 
K,(0)5?7 = A f | 4? cos? 0[1 — cos(så cos 0)]A- 9 * ? dA dé. 
0 0 
Thus (8.5.6) implies 


2m (o5 
f f A? cos? 0[1 — cos(sd cos 8)] 
0 0 


Ee 1768+29 dy 


dð > 0 as t* > 0, 
t,o7(t*) K,(0) | 


which, in turn, implies that the inner integral over A also tends to zero as t* — 0 
for almost all 6. However, since s is arbitrary, this and an extension of the 
continuity theorem (Lemma 1.4.1g) imply that 


dG(A/t,) a 2478+29 
> 
tici(t*) K0) 


as t* > 0, 


which is almost (8.5.5). To establish the lemma for general t write 


dG(jt| ) AGAI eit. 0) 
Itl ai eedt 0) AW 
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and check that 


ci(t) 


2n moo 
oX(Iti, 0) = x Í Í A? cos? 6[1 — cos(Acos(0 — $))] 
1 , 0 0 


dG(A|t| " ) KQ) 
x > as t 0. 

|t] ^c3CIt]], 0) K,(0) 

Lemma 8.5.2 
Under the conditions of Theorem 8.5.1 we have 
. . „ det X(t) . det X(t) 

8.5.7 0 < lim inf ——75;—— < lim sup ——5—— . 
G3 METTE TC S a. Te 


Proof 


The upper bound in (8.5.7) is obvious since det X(t) is less than the product 
of the diagonal entries of X, and these behave like the denominators of (8.5.7) 
near the origin. The lower bound will follow if we can show that 


X(0-X0). X, - X? 
ile i te a | 


implies that a — b — 0. The expectation can, however, be written in terms of the 
spectral distribution function as 


æ (e^t 1) (e^t _ 1) 
LL ÀA, = 
l, TRAP 


On making the substitution 4 > A||t|| " ! and setting t = tj[t|| !, this equals 


(8.5.8) lim inf E 


120 


2 


a dG(). 


dG(IA] - lit] 77) 
UKKO 


Í la(e^* — 1)o,(t) + bA,(e** — 1)|? 
0 


Now let t > 0 in such a way that t|t| ! tends to a limit point, t* say, some- 
where on the unit circle. Then as t —^ 0 we have t > t*. Then applying the 
previous lemma it is not hard to see that if (8.5.8) holds we must have 
bA,(e* — 1) = 0 so that b = 0, from which it must follow that a = 0. But 
this is what we wanted to show, so we are done. 


In conclusion, we note that Cuzick (1978, 1980) originally used the above 
techniques to study the set of eritical points of an N-dimensional random field 
at a fixed level He actually replaces (8.5.3) with the weaker requirement that 
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(when N = 2)c(t, 0) be regularly varying at the origin, as, in fact, could be done 
here. He shows that, under the conditions of Theorem 8.5.1, for every u the set 


(8.5.9) (te I: X(t) — u = 0X(t/0t, = --- = 0X(O/0t, = 0} 


has the dimension N(1 — f) — 1 with positive probability. If this is positive, i.e. 
if f «1—N'!, then X violates yet another of the conditions of suitable 
regularity; viz. (3.1.2), which required that the set (8.5.9) be finite with probability 
one. 


86 LOCAL TIME 


We noted, in Section 84, that one of the unsatisfactory aspects of our result 
on dim X^ !(u) was that we could only manage to prove that it held with positive 
probability, rather than with probability one. Furthermore, the result held for 
each level u separately, and there was no probabilistic uniformity in u. To 
overcome these problems it is necessary to undertake a far more careful analysis 
than that we have so far been engaged in. In particular, the measure u introduced 
in the proof of Theorem 8.4.2 needs to be very carefully studied. A priori this 
is not surprising. After all, uis a measure supported on the set of interest, X^ !(u), 
and the properties of u as a measure on the whole of J, should contain a great 
deal of information about its support. Let us commence, therefore, by looking 
a little more closely at this measure, and provide it with a heuristic, but enlighten- 
ing, description. 

Firstly, recall that there was not actually one measure y, but rather we were 
dealing with a family of such measures, each supported on a different level set. 
Thus, if we write 


(&61) plu, B) = lim (=) e|- "xou dt, 


no 2 


we formally acknowledge this by the introduction of the second parameter, u. 
(Note that this is not quite the same as the measure u of Section 8.4, but rather 
is a multiple of (2x) ^? of that measure. Nevertheless, constructing u as in (8.6.1) 
would have sufficed for the purposes of Section 8.4, so we shall use this definition 
of it here.) For the moment, let us drop the usual demands of rigour and allow 
ourselves the freedom of interchanging limits and integrals without full justifica- 
tion. Then, if A is a Borel set of Z4, we have 


f pu, B) du = [iim ICM exp| - EM au} dt. 


For fixed t, the inner integral is simply the integral, over A, of the density of d 
independent Gaussian variates with the vector mean X(t) and identical variances 
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n^ !. Thus, as n > œ, this integral will converge to 1 if X(t) e A and zero other- 
wise. Hence, writing I; for the indicator function of A, we have 


(8.6.2) I uu, B) du = | 1,(X(t)) dt. 


But the right-hand side of this equation readily admits a simple interpretation. 
It is no more than the amount (in terms of N-dimensional Lebesgue measure) 
of the set B over which the field X(t) takes a value in the set A. For fixed B this is 
obviously a measure on # and (8.6.2) tells us that u(u, B) is no more than the 
density of this measure. That is, u(u, B) can be interpreted as the amount of the 
set B over which X(t) actually takes the value u. 

This description of u ties in with what we already know about it. For example, 
if X(t) = u for no te B, our description implies that u(u, B) must be zero. But 
this is equivalent to saying that „(u, .) is supported on X^ !(u), which we already 
know to be true. 

Since u obviously contains a considerable amount of information about the 
locations of values of X, it is known as its local time. Its integral, f , u(u, B) du, is 
called the occupation measure of X, since this describes the amount of B over 
which A is occupied by X. 

To return now from heuristics to a complete and rigorous examination of the 
local time of an (N, d) field is, unfortunately, a rather difficult step to take, and 
would involve far more time than we are prepared to devote to it here. Further- 
more, and perhaps more importantly, the rigorous mathematics of local time 
requires a level of mathematical sophistication substantially more advanced 
than that employed elsewhere throughout this book. However, since local time 
is an extremely important concept not only in dimension theory but also in the 
general sample function analysis of Gaussian processes, we shall devote this and 
the following two sections to a description of how this concept can be utilized, 
omitting difficult proofs and avoiding difficulties in those proofs we do give. 

In essence, we shall now present a somewhat simplified and substantially 
abbreviated version of a wide-ranging review of local time presented in Geman 
and Horowitz (1980). Because of the existence of this survey, in which the his- 
torical development of local time methodology is carefully chronicled, we shall 
not bother to assign credits for the various results and techniques presented, 
other than to note here the following. The notion of local time for stochastic 
processes was first introduced by Lévy (1948, Sec. 50) under the name ‘mesure du 
voisinage. He was interested primarily in Brownian motion sample paths. 
Blumenthal and Getoor (1964) introduced this concept for general Markov 
processes, while, in a significant contribution, Berman (19694) brought it into 


cerned, other relevant papers are by Berman (1969b, 19703, 1970b, 1972, 1973), 
Geman (1976, 19774, 197 7b), Cie man and Horowitz(197 3 1976), Pitt (1978), and 
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Adler (1978c), as well as the book by Saks (1937). A much fuller bibliography of 
local time is given in Geman and Horowitz (1980). 

We start by ignoring what we already know about the random measure yp 
and commence anew, initially by considering the notion of local time for non- 
random functions. Thus, we take a non-random continuous function F(t), 
defined on the unit cube in 2" and taking its values in 2°. Taking Borel sets 
A c R’ and B c 4", we can define the occupation measure of F, analogously 
to (8.6.2), by 


vA, B) = Í 1 (F) dt 


= Ay(A ^ F` (B). 


This measure is well defined for every continuous F and, as we have already noted, 
describes the amount of B that F spends in A. 

For some functions F the corresponding occupation measure will, for fixed B, 
be absolutely continuous with respect to Lebesgue measure in 4^: i.e. there will 
exist a real-valued function p(u, B) on 4 satisfying 


(8.6.3) WA, B) = Í u(u, B) du 


for all Borel A. This immediately leads us to an important definition. 


Definition 8.6.1 


A function F for which (8.6.3) holds for every Borel B c I, and A c H is said 
to be LT, (local time) and the function u(u, B) is called a local time of F. 


The first thing to note about a LT function is that (8.6.3) does not define a 
unique local time, since any function yp satisfying (8.6.3) can be changed, for 
each B, on a u set of A, measure zero, and still satisfy this defining equation. One 
way out of this problem is to work with equivalent versions of u. Since we wish 
to avoid the measure-theoretic complications this gives rise to, we shall assume 
that our local times are sufficiently smooth for the versions we start with to be 
good ones. To formalize this smoothness we write the local time y(u, B) in the 
distribution function form 


N 
(8.6.4) > plu, t) = s» Į TTo, 2! 
i-1 
so that pis simply a real-valued function on 47 x I,. We shall say y is jointly 
continuous in both its parameters if, in this form, p is continuous on # x I,. 
With this assumption we can show that the local time behaves as we want it to, 
at least as far as its support is concerned. For brevity, let L4 denote the level set 
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F^ !(u), and L$ its complement in J,. Furthermore, let # denote the family 
of rational intervals in I,; i.e. sets of the form [ [|]. , I;, where each I; c [0, 1] 
is an (open or closed) interval with rational endpoints. Then, writing F(I) for the 
image, or range of F as t varies over I, and denoting closure with a bar, we can 
prove the following theorem. 


Theorem 8.6.1 


If F is LT and possesses a local time u(u, t) which is jointly continuous then 


(8.6.5) uu, I) = 0 ifu ¢ F(D), I e Z. 
(8.6.6) u(u, LE) = 0 for every u. 
Proof 


We must start by showing that, for each u, u(u, -) is a well-defined, countably 
additive measure on the Borel sets of I,. To do this, note that if s, t € I,, and 
si < lj, i = L..., N, then the defining relation of local time, (8.6.3), implies that 
for almost every u, u(u, s) < u(u, t), writing p in the distribution function form. 
Since p is continuous in u, this holds for all u. This immediately implies, via 
standard arguments, that in its measure form y(u, -) is a countably additive, 
finite, measure on I, for each u. 

We now claim that for each 1 € # and n > 1 the set 


(8.6.7) {u € 2°: u(u, I) > n^! and u ¢ FD} 


has A, measure zero. Indeed, if this were not the case, F(t) would lie outside of 
F(1) for a subset of I of positive measure, which is clearly self-contradictory. 
Since # is countable, the union of the sets (8.6.7) over all n > 1 and I € 3€ also 
has zero measure. This implies that, for every I € #, u(u, I) = Ofor almost all u 
outside F(T). However, since y is continuous it must vanish everywhere outside 
of F(J), and (8.6.5) is established. 

To prove (8.6.6), we start by noting that, for each u, 


(8.6.8) LE = Ü (te I,: |F(t) — uj > n^ !). 


Each set in this union can be obtained as the countable union of open intervals in 
X . By continuity F(t) is bounded away from u on each of these intervals, and so 
by (8.6.5) the local time at u on cach interval must vanish. But since by the first 
part of the proof gu is a countably additive measure on A, it must also vanish on 
cach of the sets in the unton (8.6.8) and thus, ultimately, also on £4. This proves 
(8.6.6) and the theorem 
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Another interesting and informative result on local times is the following, 
whose proof follows standard approximation procedures. 


Lemma 8.6.1 


If F is LT with a local time p and g is any real- or complex-valued Lebesgue 
measurable function, then, for any Borel B c A”, 


(8.69) Í JEFO) dt = f owa, B) du, 


in the sense that if one of the two integrals exists then both do and the two are 
equal. 


In fact, we do not need F to be LT for a result like the above to hold, for if v 
is the occupation measure ofany continuous F, and g is as above, it is always true 
that 


(8.6.10) Í g(F(t)) dt = Í g(u) dv(u, B). 
B Ra 


This equality is important when seeking simple necessary and sufficient 
conditions for a function to be LT. There are essentially two different approaches 
to determining whether a function is LT or not: one relies on Fourier theory, 
while the other rests on the actual construction of a local time. Since we have 
already performed one construction in Section 8.4, and shall undertake yet 
another later on, we shall take the Fourier path now. To do this we require 
the following lemma, which contains standard Fourier results. A proof of the 
first part can be found, for example, in Berman (1969a), while Cramér (1946) 
gives a proof of the second part. 


Lemma 8.6.2 

(a) Let G be a distribution function on # with a square integrable character- 
istic function G. Then G has a density g which is also square integrable. 

(b) If, furthermore, G is integrable, then the density g is continuous. 


Recall that a function H is said to be square integrable (integrable) if 


Í |H(u)|? du < oo (Í | H(u)| du « 2! 
Ra Ra 


With this lemma we can prove the following theorem. 
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Theorem 8.6.2 ! 


Let F: A“ — # be continuous. Then F has a local time u(u, B) square in 
tegrable in u for every B if and only if 


2 
(8.6.11) Í Í exp[i@ - F(t)] at dð < oo. 
ga | JI, 
Furthermore, if 
(8.6.12) Í Í exp[i0 - F(t)] dt |ee < 00 
$4 | JI, 


then F has a local time continuous in u for each B. 


Proof 


Fix B c I, and write the occupation measure v in distribution function form 
viz. vu) = v([ [L. | (— œ, u;], B). We must show that v(u) has a square integrab} 
density if and only if (8.6.11) holds. The characteristic function of v is simply 


$(0) — ND dv(u) 


= f expe - F(t)] dt 


on applying (8.6.10). But if (8.6.11) holds we now have that 6 is squareintegrabk 
so that by Lemma 8.6.2(a) v has a square integrable density. Since this is a loca 
time for F the ‘if’ part of the first half of the theorem is proven. 

To prove the converse, we need consider only B = I, and suppose F has; 
square integrable local time y(u, I,). Since this is a density it has a Fourie 
transform (characteristic function) f(®, I,) and Parseval’s relation implies 


2 > Í |i(u, I,)|? du = Í |/0, 1,)|? de 
Ra Ri 


= 
al 


applying Lemma 8.6.1 to obtain the final equality. But this is (8.6.1 1)so we hay 
finished with half the theorem. 


2 
dé 


Í exp(i0 - u)u(u, I) du 
Rea 


2 
d9, 


f exp[i0 - F(t)] dt 


o 
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That (8.6.12) implies the existence of a continuous local time is similarly a 
consequence of Lemmas 8.6.1 and 8.6.2(b). This completes the proof. 


Let us now use this theorem to investigate some simple functions. The simplest 
of all functions is, of course, the constant function F(t) = c, t € [0, 1]. For this 


we have 
1 
Í f 4 dt 
LIMPIO 


2 
ao = Í |e |? ae 
gi 


li 


dé 


gn 
= 0. 


That is, the constant function on [0, 1] does not have a square integrable local 
time! The reason for this is reasonably obvious. The occupation measure 
v(u, I,) of the constant function is simply 


0 ifu <c, 
Mud. = $ ifu >c. 


Since v is discontinuous, it cannot have a density at all, let alone a square in- 
tegrable one. 
Now consider the linear function F(t) = t, t € [0, 1]. For this 


’ 


1 
Í eF O dt 
0 


This function is integrable over #', but its square root is not. Thus Theorem 
8.6.2 tells us that F has a square integrable local time, which may not be con- 
tinuous. In fact, it is not continuous, since for this function 


2 e? —1 


ig 


?  X1-— cos 0) 
— g2 " 


0, u< 0, 
vu, [0, x) 214 O<uK<x, 
X, ux, 


implying that the local time is given by 


0, u< 0, 
ulu, [0,x])= 1, O<u<x, 
0, u >x, 


which has a discontinuity at u = x. 

To produce examples of functions with square integrable, continuous, local 
times is a somewhat more difficult task, for the basic principle underlying local 
time methodology is that smooth functions possess badly behaved local times, 
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if they possess local times at all, while smooth local times arise from functions 
exhibiting a high degree of erraticism. Since highly erratic functions are complex 
to produce, other than as sample functions of erratic Gaussian processes, we 
shall wait until we have returned to the sample function setting to provide 
examples of the latter nature. 

Having set up the notion of local time, we shall now proceed as follows. The 
following section will justify our expectations that the knowledge of the local 
time of a function enables us to quantify, through Hólder conditions and di- 
mension theorems, the function's erraticism. Section 8.8 will apply this method- 
ology to the study of Gaussian sample functions, where we shall be able to 
produce (random) functions with smooth local times. Finally, in Section 8.9, we 
shall take a close look at two very special Gaussian fields. 


8. LOCAL TIME, ERRATICISM, AND DIMENSION 


Our first task is to show that the possession of a smooth local time indicates 
that a function is highly erratic. To do this, we must firstly specify precisely what 
we mean when we say that a local time u(u, B) is ‘smooth’, particularly in view of 
the fact that y is a function of two parameters, one of which ranges over the Borel 
sets of I,. The most useful way of doing this is to consider one parameter at a 
time and, furthermore, restrict the set parameter B to the class of cubes in I, rather 
than letting it range over all Borel sets. When we do this we can introduce the 
following definition. 


Definition 8.7.1 


Let F: I, c 4" > 4" be LT with local time u(u, B). Then we say that p satisfies 
a uniform (in u) Hólder condition of order «, 0 < a < 1, in the set variable if there 
exists a finite constant M such that, for every u, 


(8.7.1) uu, B) < M[Ay(B)}* 


whenever B is a cube in I, of sufficiently small edge length. Similarly, we say that u 
satisfies a uniform (in B) Hólder condition of order a, 0 < « < 1, in the space 
variable if 


(8.7.2) [u(u, B) — u(v, B)| < M||u — v||* 
for every u, v, some finite M, and ||u — v|| sufficiently small. 
If a function's local time is 'Hólder smooth’ in either of the above senses, then 


this has major implications for the Hólder conditions that the function itself 
can satisfy. For example, we have the following theorem. 
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Theorem 8.7.1 


Let F: 1: 4" — & be a continuous function, possessing a local time satisfying 
a uniform Holder condition of order « in the set variable. Then nowhere in I, do all 
the coordinate functions of F satisfy a Hólder condition of order greater than 
N(1 — o)/d. 


Proof 


We shall prove the theorem under the added (but unnecessary) assumption 
that y(u, t) is jointly continuous, so that we can call on Theorem 8.6.1. As a 
first step we shall establish the existence of a finite C > 0 such that, for every 
cube B in # (the rational intervals in I,) of small enough edge length h, 


(8.7.3) max 


sup F(t) — inf F,(t) | > Ch- oa, 
B B 
From the definition of local time we have that, for any cube B of edge length h, 
h^ = A (B) = Í u(u, B) du. 
Ra 


Theorem 8.6.1 tells us that u(u, B) vanishes outside the closure of the image of 
F(t), t € B, so that the above equation implies 


Mı Ma 
w=f zi u(u, B) du, 


where m; = infg F,(t), M; = sup, F(t). But applying the Hólder condition to the 
integrand, we find, on allowing h to be as small as necessary, that 


d 
n" < H (M; — mi) x M[A(B)]* 


d 
< n(x IM; — ni) 


which immediately implies (8.7.3). 
To complete the proof we must show that, for every t e J,, B > N(1 — ad, 
and every K > 0, 


(8.7.4) max sup{|F,(s) — F,(t)|: lls — t| < hj 2 Kh? 


if h is small enough. But for any h > 0 we can find a cube B containing t, be- 
longing to #, and with an edge length at least 2(1 -- eyh//N, where « > 0 is 
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arbitrary, such that B c (s: ||s — t|| < h}. By the triangle inequality, the left- 
hand side of (8.7.4) is at least 


4 max | sup Fs) — inf F,(s) |, 
i B 


which, by (8.7.3), is at least C[2h(1 — £)]"! "9 for h small enough. This im- 
mediately implies the validity of (8.7.4), and we are done. 


Let us take a moment to consider exactly what it means for a function to fail 
to satisfy a Hólder condition of order ff, 0 < B < 1, everywhere. To facilitate 
our description, let us assume for the moment that we are dealing only with a 
real-valued function, F(t), defined on [0, 1]. Then we immediately see that the 
function has no derivative, anywhere in [0, 1]. The existence of such a derivative 
hinges on the existence of limits of the form lim, | o [F(t + h) — F(t)]/h, and 
these cannot exist, since the ratio here is at least equal to [F(t + h) — F(t)]/P, 
0 « y < 1, and this diverges as h | 0 for any y > f. 

But much more than this is true. Indeed, if at each point (t*, F(t*)) on the 
graph of F we place a cone with vertex at (t*, F(t*)) and boundaries growing like 
hf, it follows from the lack of a Hólder condition that everywhere along the 
graph of F its trajectory will always fail, near the vertex of the cone, to lie within 
it. 

This description of the lack of Hólder conditions can also be applied, in 
essence, to functions on 4?" taking values in 2°. For example, we immediately 
have that in this case the coordinate functions will not possess all their first- 
order partial derivatives. The cone argument also carries over to this case in an 
obvious fashion, although it now, of course, becomes extremely difficult 
(especially if N + d > 4) to visualize the corresponding phenomenon. 

In much the same way that we have used Hólder smoothness in the local 
time's set variable to generate nowhere Hólder conditions for the function 
itself, Hólder smoothness in the space variable leads to similar results. For 
example, although we shall not prove it here, it is possible to establish the 
following, a special case of Theorem 11.1 of Geman and Horowitz (1980). 


Theorem 8.7.2 


Let F: I, c 4?" > 9?! be a continuous function, possessing a local time satisfy- 
ing a uniform Holder condition of order « in the space variable. Then at every t* 
which is a local maximum, or minimum, of F we have that, for any B > N/(d + a) 
and every K > 0, 


(8.7.5) sup{ | F(t) — F(t*)|: |t — t*|] < h} > KA" 


if his small enough. That is; the function does not satisfy a Holder condition of order 
[! at t*. 
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The importance of this result is that for a very large class of Gaussian (N, d) 
fields, which includes both the index-« fields and the Brownian sheet, it can be 
shown that the set of local maxima and minima of their sample functions is 
almost surely dense in I, (see Tran, 1976, and Pitt and Tran, 1979.) Hence, 
since these fields also possess almost surely continuous sample functions, it 
follows that (8.7.5) holds, for these fields, for every t* € I, and every i, with 
probability one. However, since it is a non-trivial exercise to establish the 
aforementioned denseness, we shall not go into this any further here. 

Let us now return to the erraticism of primary concern to us—that exhibited 
by the level sets F^ !(u). As before, we shall obtain the dimension of the level 
sets in two steps, by finding upper and lower bounds which, under appropriate 
conditions, are identical. We start with the upper bound first. 


Theorem 8.7.3 


Let F: I, c 4?" > R’ satisfy a uniform Hélder condition of order a, 0 < a; < 1 
on I,, and possess a jointly continuous local time (u, t). Then, for every ue 4, 


d 
(8.7.6) dim F^!(u) x N — Ya, 

i-1 
if N — M: a; > 0. 


This theorem gives a strengthening of the result of Lemma 8.2.2, where (8.7.6) 
was obtained for almost every u. There, however, no assumptions were placed 
on the local time. 


Proof 


For each n > 1 and each integer lattice point i = (i,,..., iy) with 1 < i; < 2" 
for each j set, 


(8.7.7) By (te hilt- (ij — 1277| < 270* 5,4 = 1,...,N}, 


N 1/2 
(8.7.8) B% = ‘ El: |» Ires wr « /NQ + piu 
j=l 


Then B,, c BÀ and the B5 are open spheres. Furthermore, the spheres Bj, for 
which at least one point of the level set F^ !(u) lies in the corresponding cube 
B,, form a covering of F^ !(u). If 0 < B < N, then the sum of the fth powers of 
the diameters of these BA is proportional to 


(8.7.9) (2n !) "7 x Ii: F(t) = uforsomete By}. 
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Since F satisfies a Hólder condition of order a on I,, there exists a finite M > 0 
such that, for each j = 1,...,d, 


|F — F(sS| < Mlt — sl”, for s, t € Bm, 
for large enough n and any i. Thus, for large enough n, (8.7.9) is not larger than 
(8.7.10) 
Qn 1) **? x x {ii [F0 — u| < A-27"5, j= 1,..., d, and all te By}. 


where A = sup; M(,/'N)%. 

We shall now show that f > N — $5., a; implies that (8.7.10) tends to 
zero as n > 0. From the definition of Hausdorff dimension this is sufficient to 
prove the theorem. We shall adopt a proof by contradiction. Thus, suppose that 
B >N — Y4.,4;,>0 and (8.7.10) does not converge to zero. Then there 
exists a y > 0 whichis exceeded by (8.7.10) for infinitely many n. Thus the occupa- 
tion measure v(D,(u), I,) of the set 


d 
D,(u) = [[(u — A-2775, uj + A-27™) 
j=1 


is at least n(2 + n^ !y? . 27"" for infinitely many n. In terms of the local time this 
means 


Q 4 n ie Bre uv, I) dv > 4 


Dn(a) 


for infinitely many n. But this contradicts the continuity of u, which implies 
that the left-hand side of the above inequality is not greater than 


(2A)! max yv, I,) 27 "Ej- 10782 4. nA) +B 


4 1\ 
= (2A)! max u(y, I,) - poaa = in) Q — n1)? 


which clearly tends to zero as n — oo as long as $ > N — } 4- a; > 0. This 
completes the proof of the theorem. 


We can now turn to the problem of determining a lower bound to the di- 
mension of the level sets, as in the following theorem. 


Theorem 8.7.4 


Let F: I, c A“ —> 4!" be continuous and LT, and suppose that its local time 
satisfies a uniform Hólder condition of order a in the set variable. Then 


(8.7.11) dim F !'(u) > Nx 


for every u not contained in the set of zeros of pu, L). 
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This is, at first sight, an extremely unusual result, insofar as it links the u for 
which (8.7.11) fails to the zeros of the local time. Its importance lies in the fact 
that, in general, it is surprisingly simple to obtain information about this zero 
set, both in the current setting (Lemma 8.7.2) and in certain random situations 
(Lemma 8.9.3). 

In order to prove Theorem 8.7.4 we require firstly the following lemma. 


Lemma 8.7.1 


Let u(-) bea non-negative, finite measure on the Borel sets of I, < R”, satisfying 
a Hólder condition of order «, 0 < « < 1, over all cubes in I,. That is, there exists 
a finite M > O such that for all cubes B c I, of sufficiently small edge length 


(8.7.12) WB) < M[Ay(B)I*. 

Then if A is a Borel set in I, and dim A = f, where 0 < B < Na, 
(8.7.13) (A) = 0. 

Proof 


Since dim A = fj, it follows from the definition of Hausdorff dimension that 
for every f > f and every n > 1 there is a covering of A by open spheres S,,, of 
diameter d,,, k = 1,2,..., such that d,, < n^! for each k and 


lim Y (d, < oc. 


n2o k=1 


From this we have that if Na > f' then 


(8.7.14) lim X (dm) = 


n>% k= 
Since each $,, can be covered by a cube of side length J/N d,,,, we can exchange 


the spheres by cubes B,, of edge length JN dix» Whose union also covers A, so 
that (8.7.14) will still be satisfied. Now recall that f < f' < Na. Then since 


u(A) < lim sup X HBr) 


n> 


and, by the Hölder condition (8.7.12), we have that the right-hand side is 
bounded by 


MCG/NY* lim sup Xa". 


noo 


it follows from (8.7.14) that (A) = 0, as claimed. 
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Proof of Theorem 8.7.4 


We are now in a position to prove Theorem 8.7.4. We shall again make the 
(unnecessary) added assumption that the local time is jointly continuous, so as 
to enable immediate application of Theorem 8.6.1. Let u be a level where (8.7.11) 
fails. Thus dim F~'(u) < Na. By Theorem 8.6.1 the support of the local time 
Hu, -) is contained in F^ !(u). But since this set has dimension strictly less than 
Na it follows from Lemma 8.7.1 and the Hólder condition on u that 


uu, F^ (u)) = ju, L) = 0. 


That is, u belongs to the zeros of u(-, I,), as we were required to show. 


Since it is now clear that the zeros of the local time have an important role 
to play in dimension problems, we turn our hand to finding out a little more 
about this set. For this purpose, recall that a subset A of a set B c Æ" is said to 
be nowhere dense in B if its closure A in B contains no non-trivial spheres. 
Equivalently, A will be nowhere dense if the interior of A is empty. Clearly 
this is also equivalent to saying that the closure of its complement is dense in B. 
Such sets are easily seen to have zero Ay measure, although they need not have 
zero dimension. An example is afforded by the Cantor ternary set, which is 
nowhere dense in [0, 1] but has positive dimension (Theorem 8.1.2). 


Lemma 8.7.2 


Let F: I, c 4" > #4 be continuous and LT, with a local time u(u, B) con- 
tinuous in u for each closed interval B c I,. Then the set of zeros of u, 


(u: uu, B) = 0j, 


is nowhere dense in the image of F. 


This is an important result for, in conjunction with Theorem 8.7.4, it yields 
that if the conditions of that theorem are in force then dim F^ !(u) > No for 
almost every ue Im F. This opens up the possibility of obtaining the uniform 
dimension results discussed in Section 8.4 (cf. 8.4.22). 


Proof 


To prove the lemma we proceed as follows. Let F(B) be the image of F over 
the closed set B. Since F is continuous, F(B) is closed. Furthermore, since p is 
continuous its zero set ts also closed. Hence we need only show that the zero set 
contains no non empty open spheres of F(B). But if there was such a sphere, S 
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say, so that (u, B) = 0 for all ue S, it would follow that the open set F^ (S) 
would have measure zero, since, writing Is for the indicator function of S, 


AKF- 8) = Í Is(F() dt 


Io 


= [io B) du 


by Lemma 8.6.1. But since F^ '(S) is open, this implies it must be empty, which 
contradicts the original assumption that S was a sphere of positive measure in 
the image of F. Thus no such S can exist, and the lemma is proven. 


With this result we complete our study of the local times of non-random 
functions. In the following section we shall apply this information to the study 
of Gaussian fields. 


88 THE GAUSSIAN CASE 


Throughout this section X(t) = (X,(t),..., X,(t)) will denote a zero-mean 
(N, d) Gaussian field on I,, with each X; continuous in probability. Our first 
task is to determine when such a field will be LT, in the sense that its sample 
functions will be LT with probability one. This will, not surprisingly, depend on 
the covariance matrix X(s, t) defined in Section 8.4, whose diagonal elements are 
the incremental variance functions c?(s, t) = E{|X,s) — X,(t)|?}. Setting 


(8.8.1) AG, t) = det X(s, t) 


we can prove the following theorem. 


Theorem 8.8.1 


If X is as above and 


(8.8.2) Í Í [A(s, t)]7 !? ds dt < oo, 
Iov Io 


then X(t, w) is LT. Furthermore, for an @ set of probability one there are local 
times u(u, B, w) which are square integrable (as functions of u). 


Proof 


According to Theorem 8.6.2, we need only show that 


(8.8.3) [. 


2 
f exp[ið - X(t)] dt| dð < o 
J.. 
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with probability one, for which we need only show that the integral has a 
finite expectation. But the expectation is simply 


Í Í Í E{exp[i@ - (X(s) — X(t))]} ds dt d0 
914 JIS s Io 


- [. Í f exp[ — 19s, t)07] ds dt de 
by (1.6.4). Integrating out 0 yields that this is equivalent to 
(2x)? a Í [AG, t)] 7+ ds dt 
which is finite by hypothesis, aid $e are done. 


It can also be shown, although we shall not do so, that (8.8.2) is necessary, as 
well as sufficient, for the conclusion of this theorem to hold. 

The basic content of Theorem 8.8.1 is simple to understand at a heuristic level. 
Suppose that the X; are independent, so that E is diagonal and A(s, t) = 

2 10s, t). Then (8.8.2) implies that the o,(s, t) increase from zero reasonably 
rapidly as t moves away from s, implying that there exists little correlation 
between each X (s) and X(t), even when ||s — t|| is small. Thus one might expect 
that the sample functions of fields for which (8.8.2) hold exhibit substantial 
erraticism. That this expectation is correct will be borne out by the forthcoming 
analysis. 

We already know, from the preceding section, that to study dimensionality 
for fields we shall require much finer information on local time other than mere 
square integrability. To obtain this information—essentially almost sure 
Hólder conditions and joint continuity—in the present random situation, we 
must, for each c, construct a local time, which, for almost all œ, is well behaved. 
The construction proceeds as follows. Fix a level u € 2° and a Borel B c I. 
Let 


(8.8.4) u(u, B) = lim inf 


n> oo 


1 
gos | Hoo. am(IX(1) — wl) at 
d B 


where B,, as usual, represents the volume ofunit sphere in Z^ and I is an indicator 
function. For each u and B it is easy to see that this is essentially the same limit 
as taken in the construction for Theorem 8.4.2 except that here we are replacing 
(purely for mathematical convenience) the Gaussian density of (8.4.15) with a 
uniform density in the sphere o(n^ +). Since u and B are fixed in (8.8.4), it is 
straightforward to argue that under appropriate conditions the limit exists in 
(8.8.4) both in the kth mean and with probability one. 

Having thus constructed y(u, B, œ) for each u, B, and almost all «o, we would 
like to let u and B vary to obtain a full local time function yf- e»). This, 
unfortunately, is not entirely straightforward. For example, it is not clear that the 
ei set where (8.8 4) fails to exist as an almost surely limit is independent of u and 
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B. Thus, taking the (uncountable) union of these exceptional sets over all u 
and B may yield a set of positive probability, indicating that a local time process 
B^, +, œ) could not be constructed in this fashion. Since the path around these 
obstacles involves measure-theoretic probability outside the scope of this book, 
we shall content ourselves, in Theorem 8.8.2, with merely stating conditions 
under which this construction will work and, moreover, yield a jointly con- 
tinuous local time. To describe these conditions, we shall firstly assume that 
X(0) = 0. This simplifies the notation and, since we are dealing only with local 
properties of sample functions, is not a severe restriction. We write X:!/*(s, t) for 
the non-singular matrix determined by X!/*(s, t)[E!?(s, t)]" = Xs, t), s # t, so 
that A(s, t) = det[E'(s, t)]?. Then writing V(Y) for the variance of a random 
variable Y and <u, v» = [X(u;v;?]!?, we can introduce the following concept 
necessary for the statement of Theorem 8.8.2. 


Definition 8.8.1 


An (N, d) field X is called locally non-deterministic, or is said to be LND, if. 
for each k > 2 and non-zero u,,... uy, uj € 2^, there exist constants C > 0 and 
ô > 0 such that 


k 
(8.8.5) z Y. Xu, Z; (X(t) — X(tj..,) | >C 
j=1 
whenever the distinct points t,,...,t, all lie in a cube of edge length at most 6 


and satisfy 
(8.8.6) |t; — tl xit. —t] forall <i<j<k, 


where to = 0 and X; = X "*(t;, t;..). 


Since the intuitive content of (8.8.5) is certainly not transparent, we shall take 
a moment to discuss it and provide examples of LND Gaussian fields. Firstly, 
note that the inequalities (8.8.6) are automatically satisfied when N = 1 and 
ti € c: «t. In general, for t,,...,t, in I,, there is a permutation (j,,...,j,) 
of (1,...,k) such that t,,,..., tj, satisfies (8.8.6): choose ją = 1 and then j,_, to 
satisfy |t; , — tj] € |t; — tjl for all j # jx, etc. Thus, it is not hard to see 
that (8.8.5) implies X must have ‘almost orthogonal increments’, for if we took 
in (8.8.5) a sequence t?”, . . . , t(? of points satisfying (8.8.6) and 


lim max ||t(? — ("|| = 0, 

noo i,j 
the variance in (8.8.5) would approach zero as n > œ unless a phenomenon akin 
to orthogonality of the increments X(1) — X(t^,) for each n and j was 
occurring. 
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When N = litisnot hard to show (Berman, 1973) that for Gaussian processes 
local non-determinism is equivalent to 


V(X(t) _ X(t, | X(ti)..., X(tk-1) 
(87) — lim inf. VIX) — X(t, 1) 5 


for every k > 2andt, < ++- < t,. Since the expression after the lim inf is merely 
the ratio of conditional to unconditional variance of the increment X(t,) — 
X (t, 1), it follows that under (8.8.7) this increment is relatively ‘unpredictable’ 
based only on knowledge of the data X(t,),..., X(t,_,). This is despite the fact 
that all the points t,,..., t, are close. Hence the term ‘local non-determinism’. 

For an example of an LND field, consider the (N, d) Gaussian field X whose 
componement fields are independent with covariances 


0 


(8.8.8) E{X(s)X{t)} = eflls? + [t^ — le- sl} — 0B; <1. 


This is clearly an index- field, for which we have the following lemma. 


Lemma 8.8.1 
For any € > 0, the Gaussian field described above is LN D on 
T = {te *:e« |tl < e^). 


The proof of this result relies on the following lemma. 


Lemma 8.8.2 


For an (N, 1) Gaussian field with covariance (8.8.8) there exists a strictly 
positive constant C = C(N, ) such that for any t € R` and positive r < |t|| we 
have 


(8.8.9) V{X(t)| X(s): |t — sl] > r} = Cr”. 


Proof 


We shall give only a partial proof of (8.8.9), establishing the existence of a 
C > 0. Fourier analytic arguments can be used to show C > 0. For details, see 
Pitt (1978). 

Since X(0) = 0 the case t = 0 is clear. If t # 0, set t* = (r/||t||)t. Then t* lies 
on the line joining the origin to t, and the point t — t* lies at the intersection of 
this line with the set {s: |t — s|| = r}. Thus 


VIXQ) X(G) is t > ri 
VINO) Xa COLAS NX) s toco nw the rt, 
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where we have used the fact that X(0) — O enables us to (apparently) change the 
conditioning event. Applying the fact that X has stationary increments we have 
that the above variance equals 


(8.8.10) V(X(t*) — X(0)| X(s) — X(u): |s — t*| > r, Ju — t*|| > rj 
= {X(t*)| X(s: |s — t*]| > r} 
V{X(t*)| X(s): Is — t*|| 2 [I 
since ||t*|| = r. Now we can use the fact that X is scale invariant, in the sense that 
for any real y > 0 the field Y(t) = |y|~*X(yt) has precisely the same covariance 


function as X. This immediately implies that the final variance of (8.8.10) must 
be of the form C|[t* |? = Cr? for some C > 0, and the proof is complete. 


Proof of Lemma 8.8.1 


We now turn to the proof of Lemma 8.8.1. Suppose the conclusion is false; 
ie. X is not LND on T,. Then there must exist an integer k > 2, d vectors 
u ...,u® not all zero, and a sequence (t,..., tt} = {t,,...,t,} with each 
t; € T; and |t;,; — tj| < lt; — ti| for 1 < i € j € k, and such that 


kod 
ži » uP LX j(t;) — XKti...)]lt; — t; ll u 


i=1 j= 


converges to zero as n > oo, where we take t; = 0. Because of the independence 
among the X; this also implies 


k 
vf uPEX (td) — X f(t; ))llt; — var) 230 
i-1 


for each j. Assuming, without loss of generality, that uf? # 0, and temporarily 
dropping the subscript j, we have from Lemma 8.82 that 


Zp uX) — X(t. )]lt; — t- ll J 


(uY . 
mo 34 V{X(t,)| X(s): t, — sl] 2 min((|t,, t; — tz- D} 
It, — t, l° 
(u 9)? . 
27> C[min(|It,l, |t; — t, 1]. 
|t, — t, .,1? 


Sinceu™ # Oand ||t,|| > ethisis bounded away from zero. But this is a contradic- 
tion, so that X must be LND on T, as claimed. 


Before we leave this example it is worthwhile to note that the unfortunate 
condition that the T; be bounded away from zero is necessitated by the fact that 
X(0) = 0. Thus, for example, the field Y(t) = X(t)+ Z where Z is a d-di- 
mensional vector of independent Gaussian variates, also independent of X(t) 
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for all t, is LND on the whole of Æ”. It is also possible to avoid this problem by 
a slight alteration of the definition of local non-determinism, which we shall not 
bother with here (for details, see Pitt, 1978). 

With the concept of local nondeterminism now established, we can state the 
following result, in which, for each k > 2, 0 < y < 1, and Borel B c I,, we write 


k 
(8.8.11) V, (B) = fo f gaea, t; ,) dt, --- dt,. 


Theorem 8.8.2 


Let X be LND and suppose V, ,(1,) < oo for some 0 < y < 1 and even integer 
k > d/y. Then X is LT and possesses a local time u which is jointly continuous on 
R? x I, with probability one. Moreover, there exist constants C}, C4 > 0 such 
that, for every Borel B c I,, 


(8.8.12) E(|u(u, B) — p(y, B) < C, lu — vl V, (B), 
and 
(8.8.13) E{|1(0, B)} < CoV, (B). 


As we have already noted, we shall not attempt to prove this result here. 
(For a proof, see Geman and Horowitz, 1980.) However, we shall note that the 
proof proceeds by taking the u(u, B) of (8.8.4) and extending this, initially to a 
process on almost every u and all B € #. Equations (8.8.12) and (8.8.13) arise 
by taking moments of expressions based on (8.8.4), and then this process is 
extended, via continuity and separability, to a process on all u and all Borel B. 

We now turn to the problem of determining stochastic Hólder conditions, in 
the set variable, for Gaussian local times. Let 2, = {I,}, and for each n > 1 
let 2, be the family of 2"" ‘dyadic cubes’ in 1,, each of measure 2~"" obtained by 
successive subdivision of I,. Also, put 2 = (J 51 Pn 


Theorem 8.8.3 


Let X be LND and suppose that for some 0 < y < 1 and even k > d/y there is 
a ô = ó(k, y) > 1 for which 


(88.14) - V. (B) < C,[A4(B)]? 


for every B € Z,, n È no, for some constants C, and ng. Then for each compact 
U c #4 and each a < (6 — 1)/k there is a constant C, and a random variable 
e = e(w) such that, with probability one, the local time satisfies 


(8.8.15) nu, B) < C,[Ay(B)}* for allue U, 


for any cube B « 1, of edge length less than c. 
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Proof 


We begin by noting that the assumptions of Theorem 8.8.2 are in force, so 
that the conclusions of that theorem are valid. Thus, in particular, we have that 
the local time is jointly continuous. In what follows, C, C, will denote constants 
which may change from line to line, but do not depend on B. Fora > 0, Be X, 
and compact U e Z^, define the quantity 


y(u, B) — 
ult "a 


Aa dv. 


(8.8.16) K(B, U; k, a) = ‘a 


Then, by (8.8.12), 
(8.8.17)  E{K(B, U; k,a)} < CV.) Í Í |u — v|*? 9 du dv 
UJU 


which is finite as long as (x — y) < d/k. That is, if (a — y) « d/k, we have that 
K(B, U; k, a) is finite with probability one. This allows us to apply Lemma 3.3.3, 
extended from I, to compacts U, identifying the function f(-) there with y(-, B) 
here, to obtain that, with probability one, 


| w— v Il 
(8.8.18) |u, B) - (v, BI < C if [K(B, U; k, ay] x7 24x71 dx 
= C, [K(B, U;k, 2)]!^|u _ v||*- 24. 


for all u, v e U, where C, = C,(o)is independent of B. 
Now let B,, B;,... be the sets in 2, where B, = L, 2, = (Bj,..., Bassi). 
etc. Also, let ó* < ô — 1 and write A, for the event 


A, = {K(B,, U, k, y) = [4(B,)] 3. 
Then by (8.8.17) and (8.8.14) and Markov's inequality (1.3.1) we have 


P{A,} € [AB,)] "EC KG, Ui ky) 
< CABAI”. 


Hence 

D P(A) <C » DPN ea EE 

n=1 m=1 
since 6 — 6* > 1. Thus, by the Borel-Cantelli lemma, there exists an almost 
surely finite random variable N; (œ) such that, with probability one, 

K(B, U; k, ô) < [A«(B)]" 

for all B € 2,,n > N,. Thus (8.8.18) immediately yields (with « = y + d/k — n) 
(8.8.19) |w(u, B) — u(y, B)| < C[As(B) "u — vj» t" 
for all u, ve U, all Be Y,,n > Ny, and any y > 0. 
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Similarly, applying (8.8.13), (8.8.14), and Markov's inequality, we find 


oo 


Y. Plu, B,) > [ABI < C È ABDI” < o, 


so that, arguing as before, there exists a random variable N (œw) such that, with 
probability one, 
(8.8.20) (0, B) < [A«(B)]" ^ 


for all B € 2,, n > Nj(o). 

Now let us take U = [0, 1]* for simplicity, and assume that the random point 
o is in all the sets of probability one mentioned above. If u € U, we may write u 
in a dyadic expansion 


oo 
u- $ Huh... uO), uP = OorL. 
j=0 


Thus, since x(u, B) is continuous in u for every Be 2€, 


u(u, B) = u(0, B) + » PROS «cos Mf) d 


n—1 
— u| 2 27 (uL. u), al. 


j-0 


If B € 2,, and n > N* = max(N,, N2), then (8.8.19) and (8.8.20) yield 
uu, B) < [A4(B)"* + [ABI] C Y, G/N 277-47" 
n=1 


for every y > 0. Since y > d/k by hypothesis, by choosing 0 < n < 4(y — d/k) 
we can make the summation finite, so that 


(8.8.21) u(u, B) x C[Ay(B)]?" 


for small enough B € 2, thus proving (8.8.15), and so the theorem for dyadic 
cubes. 

Now let B c I, be any cube of edge length e(B) less than e(w) = 2^ "€. It 
is easy to see that B can be covered by at most Ly — 16" dyadic cubes 
Ci, ..., Cz, of edge length no more than e(B). Thus each C; € | Jv. Dy, and 
An(C;) x An(B), so that 


LN 
E(u, B) < Y y(u, Ci) x CL, [A«(B)]"* 
iz 
by (8.8.21), which completes the proof. 


We are now finally in a position to be able to apply local time techniques toa 
specifie dimensionality problem for Gaussian fields. Before we do this; however, 
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let us note that all the results on local time that were established for functions or 
fields defined on the unit cube of # are also valid when the parameter set is 
any compact set in 2”. This point is relevant to the proof of the following 
theorem, which deals with a field that is not LND on 1, or, indeed, on any set 
which contains the origin, but is LND on any compact set bounded away from it. 
We shall use this more general version of our earlier results without further 
comment in proving this theorem. 


Theorem 8.8.4 


Let X be an (N, d) Gaussian field whose component fields are independent with 
zero means and covariance functions 
(8.8.02) E(X(sS)X(t)) = $ctlsl^^ + tl? — ls e]?^, | 0f; «1. 


Then if € > 0 and L, denotes the level set 1t € I,: ||t| > £, X(t) = u} and Im X = 
(u:u = X(t) for some t € I, with |t| > £}, then 


d 
paimt. = N - Y. B; for all u e Im X 
i-1 


iz 
except for an at most nowhere dense st} =1. 


Proof 


By Lemma 8.8.1 the field X is LND on I, = {te I: ||t|| > £}. To apply the 
previous theorems we need to estimate the quantity V, ,(B) defined by (8.8.11). 
As before, we shall use C to denote a positive constant that may change from line 
to line. Let S, be the sphere of radius r centred on some point t* € I,. Then 


d 
KS) = [ [Ts — ty [2824 


Spi=il 


where f* = $2, B;. Since, by a transformation to polar coordinates, we have 


Í |t — s7" dt < f lt — s^ * dt = CANT”, 
Sr Sor 


it immediately follows that 
V, KS) « CQr)^- 28*(1/2 + y)]k 
= C[AN(S,) ]7 1— BY + 2y)/N] 
It is clear that this inequality is also valid if we replace S, by a B € 2, so that 


condition (8.8.14) of Theorem 8.8.3 is in force for each 0 « y < 1 and k > d/y, 
with 


* Y 
ó —Xky)- [i d n | 
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provided this is positive, and this will certainly be the case if y is small enough. 
Hence, (8.8.15) works for each 


* 
ve sup OD, PUED 


k:k diy N 


Letting y | 0 now yields (8.8.15) with x = 1 — f*/N. Thus the local time satisfies 
a uniform (in u) Hólder condition in the set variable of every order less than 
1 — f*/N. Since Im X is compact (by the continuity of X) we have that, with 
probability one, 


(8.823) Hu, B) x C[A«(B)]* 


for all u € Im X, all sufficiently small cubes B c I,, and alla < 1 — B*/N. 

Furthermore, by Theorem 8.8.2 the local time is jointly continuous. Theorem 
8.32 tells us that X satisfies a stochastic Hólder condition of order æ for every 
a with a; < f, so that by Theorem 8.7.3 


d 
(8.8.24) dim La < N — Ya; 
i=1 
for every u with probability one. But (8.8.23) and Theorem 8.7.4 imply that, with 
probability one, 


d 
(8.8.25) dim L, 2 N — Y fj, 
i-1 
for every u not contained in the set of zeros of u(-, I). By Lemma 8.7.2 this set 
is at most nowhere dense in Im X, which, with the above two inequalities, 
suffices to prove the theorem. 


Finally, we note that if X satisfies the conditions of this theorem, then by 
Theorem 8.7.1 nowhere in J, do all the coordinate functions of X satisfy a 
Hólder condition of order greater than 3. , fj;/d. Thus, for example, if f, = --- 
= pf, = P, we have that, for each yn > 0 and small enough h > 0, 


(8.8.26) max sup{X,(s) — X(t): |s — t| x h} > Kh?*" 
for every t e J, and every K < oo, with probability one. 
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We opened this chapter with a discussion of the Brownian sheet on the plane, 
and used its sample function erraticism to motivate our introduction of the 
notion of Hausdorff dimension. However, although we have investigated the 
erraticism of many types of Gaussian fields, we have not as yet obtained any 
results for the Brownian sheet. Hence we shall now look more closely at this 
case. 
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Let us commence by recalling that, in one dimension, the Brownian motion, or 
Wiener process, is the zero-mean Gaussian process on Z1. = (t:t > 0} with the 
covariance function 


(8.9.1) Hrts—|s—tl}. 
A little algebra shows that this can be rewritten as 
(8.9.2) s At = min(s, t). 


When we come to generalizing this process to a field on the orthant 4/4 = 
(te 42^": 2 06i L..., N} in Æ, these two equivalent representations of the 
covariance functions generalize to two quite different fields on 2Y. The first 
is the so-called isotropic, or Lévy, Brownian motion, B(t), with zero mean and 
covariance 


(8.9.3) R,(s, t) = $(Isll + [ltl] — Ils — tll}, 


while the second is the multiple-parameter Wiener process, or Brownian sheet, 
W(t), with zero mean and covariance 


N 
(8.9.4) Ry(s t) = [](s; ^ tp. 
i=1 

It is easy to check that Rp and Ry are not the same, so that B and W are quite 
distinct fields. Both have an important role to play in the theory of random fields. 
For example, for the isotropic field, increments of the form B(t) — B(s) are 
stationary, having an N(0, |t — s||) distribution. Such increments are not gener- 
ally independent, however, unless increments of the form B(at) — B(bt), 
B(ct) — B(dt), a > b > c > d, are taken. (This independence is a manifestation 
of the fact that if the field is restricted to any ray in ZY starting at the origin then 
the resulting process 1s simply one-dimensional Brownian motion, as is easily 
checked from the covariance function.) The isotropic motion has had an im- 
portant role to play in model-building applications of random fields, and is 
typical of the type of field used by Mandelbrot (1975a, 1975b) to model geo- 
graphical phenomena and turbulence, and discussed in more detail in Mandel- 
brot (1977). 

We have already studied this process in some detail, for B(t) is simply an 
(N, 1), index-3, Gaussian field. Thus the results of the preceding sections provide 
virtually all the information we are interested in, as regards its sample function 
erraticism. For example, Theorem 8.3.2 yields that B(t) satisfies a uniform 
stochastic Hólder condition of every order less than 4, while (8.8.26) states that 
nowhere in I, does B(t) satisfy a Hólder condition of order greater than $. 
Theorem 84.1 gives the dimensions of its image and graph, respectively, as | 
and N + 3, while Theorem 8.8.4 sets the dimension of almost all its non-empty 
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level sets at N — 1. Thus there remains very little reason for us to subject this 
field to further study. 

The situation is quite different, however, when we turn to the Brownian sheet, 
W(t). This process arises in a natural fashion in a variety of statistical situations, 
generally as a result of weak convergence arguments. (See, for example, Pyke, 
1972, for details and further references.) The Brownian sheet does not have 
stationary increments in the usual sense. However, it can be viewed as a random 
measure on 2X by writing the increment of W over A as 


(8.9.5) W(A) = fawo, 


where A is a subset of 2Y and the integral is appropriately defined. Then (8.9.4) 
and (8.9.5) easily yield 


E{W(A)W(B)} = Ay(A ^ B) 


so that W(A) and W(B) are independent if A and B are disjoint, and the variance 
of the increment W(A) is simply 4,(A). Thus, viewed in this sense, W can be 
said to have stationary and independent increments. Because of this property 
the Brownian sheet plays a central role in the theory of multiparameter mar- 
tingales, analogous to that taken by the standard Brownian motion in the one- 
dimensional case. (See, for example, Wong and Zakai, 1974, 1976, and Cairoli 
and Walsh, 1975, for details.) 

If we look at the increment W(s) — W(t), this is easily seen to have the 
variance 


(8.9.6) Ay[(A(s) ^ A*(0) v (A*(s) ^ A(0)] 


where we write | |, [0, 4] = A(t). This is not of the form ||s — t||f for any f, so 
that W is not an index-f field. Thus we cannot apply our results on index-f 
fields to the Brownian sheet, and so we must start from scratch in investigating its 
erraticism. What we shall show, however, is that W behaves just like B as far 
as its local behaviour is concerned. To do this we start with two lemmas which 
give a preliminary indication of why this might be the case. Defining 


N N N 
(8.9.7) ó(s, t) = IIs + I [a - 2 [(s; ^ tò 


i- 


we have the following lemma. 


Lemma 8.9.1 
For any s. tc L, 


(8.9.8) EWS) Wot 96807 IN € s 
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Proof 


The equality is an immediate consequence of (89.4) or (8.9.6). Writing 
max(u, v) as u v v we have 


N N 
ó(s, t) < 2[]G; v t) - 2 [T Gi ^ t). 
i=1 i=l 


Set a = 2 [[i (s; v t) and b = 2[ [7i (s; ^ t). Then 2 > a > b and 
ó(s, t) < a(sy V ty) — b(sy ^ ty). 
If sy > ty the right-hand side is equal to 


asyN — bty = a(sy = ty) + ty(a a= b) 
< 2|sy S tyl + la — b|. 


Similarly, if sy < ty the right-hand side equals 


aty = bsy ES alty TE Sy) + sy(a = b) 
< 2|ty — sy| + |a — b|, 


so that 
N-1 N-1 
ó(s, t) < 2 |ty = Syl T 2 £ (s; V t) = 2 X (s; ^ Lj). 
i=1 i-1 
Continuing this process yields 
N 
(8.9.9) is, t) € 2Y. |t; — sil 
i=1 


< 2N |t — s]| 


as required. 


Lemma 8.9.2 
For any s, te I, with n = min(s;, tj) > 0, 


N 
(8.9.10) E{|X(s) - X0) = 7! Y Is; — tl. 


i= 


Proof 


In two dimensions this can be checked by writing out ó(s, t) and checking for 
the four cases s, 2 f,,82 2 t2. In higher dimensions (8.9.6) and a little geometry 
suffice. 
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Lemma 8.9.1 seems to say that the incremental variance of W grows no faster 
than that of an index-} field, so that it should satisfy the same sort of stochastic 
Hólder condition. That this is in fact the case can be seen from the following 
analogue of Theorem 8.3.2. (More detailed information on the local growth 
of the sample functions of W is given in Orey and Pruitt, 1973.) 


Theorem 8.9.1 


The Brownian sheet satisfies a stochastic Hélder condition on I, of order f for 
1 
every B « 4. 


Proof 


Asin the proof of Theorem 8.3.2 wecan apply the results of Chapter 3 to bound 
the modulus of continuity, c, (h), of W by 


Ap(h) + CL - log wpb + 3C [ (log w)?u-'ptu) du 
where C is a constant, A an almost surely finite random variable, and 
plu) = max{5"(s, t), |s — t| < u/N}. 
Lemma 8.9.1 implies p(u) x (2N??u)!? for all u. Substituting this into the above 


bound for @,(h) proves the result. 


We can now look at some dimension theorems, as follows. 


Theorem 8.9.2 


The dimensions of the image and graph of W(t), t € I, are, with probability one, 
land N + 3, respectively. 


Proof 


We look at the graph only. The image is easy and not very interesting. 
According to the previous theorem and Lemma 8.2.1, the dimension of the graph 
is at most N + 4. Let I,,4 > 0, denote the set (te I,: ti > 9i = 1,..., nj. Then, 
according to Lemma 8.9.2, 


N 
(8.9.11) (s t) > 4% E ls-t| for alls, tez. 
i= 


Furthermore, since /, c 7, implies 


dimi(t, Wa) te dy) + dinit Wa) te 4i 


nt 


248 SAMPLE FUNCTION ERRATICISM AND HAUSDORFF DIMENSION 
we need only show that the right-hand dimension is at least N + 3 for some > 0. 
To prove this we need only prove, using Lemma 8.24, that 


(8.9.12) Í Í E([IW(t) — W(S!? + |t + sl?]7*?) dt ds < œ 


for any 1 <a < N + 5. The expectation here is simply 


Tis JE ELTE le = SPI"? exp - rol * 


which transforms to 


1 = . 
—| [u^3(s, t) + It — s217 %2e7 2 du 
fon -œ 


2 æ me iit = si? -aj2 
< Jin f [à(s, t)] afi + es] du. 


Once again (as in the proof of Theorem 8.4.1) we apply the fact that, if y 1, 

JE O? + a)? dy = C(y)a^?* '? to find that the above expression is bounded 

by a constant multiple of 

|t — sl 
ó(s, t) 


Substituting this back into (8.9.12) yields that the integral there is bounded by a 
constant multiple of 


[ô(s, e| "|o = [é(s, 9] !2|t — s|7** !. 


Í ó(s, ^ !?|t — s| 77+! dt ds 
I, 


n” In 
N -1/2 
D Í (X a ul It — s||~*t! dt ds 
1,\i=1 


on applying (8.9.11). However, since |t — s| < $7. , |t; — s;| for any t, s € A”, 
this last expression is bounded by 


gom lt zm s| —a+1-1/2 dt ds 
Io To 


and this expression is finite whenever (Ca + 1 — 1) + (N — 1) > - 1, ie. 
a < N + 5 (seen by transforming to polar coordinates). But this establishes 
(8.9.12), and so the theorem. 


The above two theorems indicate that although the Brownian sheet is not an 
index-3 field, it behaves, locally, very much like such a process. Furthermore, it is 
possible to prove this without much difficulty. This is not the case, however, 
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when we come to studying the zero sets or local time of W. For example, it can 
be shown that W is not LND, so that the construction of a local time described 
in Theorem 8.8.2 for LND processes does not work for W. (Although a con- 
struction of this type was attempted in Tran, 1976b, it contains flaws, as noted in 
Pruitt, 1978.) To construct a local time for W, and investigate its properties, we 
need to proceed in a rather indirect fashion, due to Cairoli and Walsh (1975) 
and Walsh (1978) for N — 2, and Davydov (1978) for general N. 

To describe this construction we set, for each te 2Y , V, to be the (N — 1)- 
dimensional interval [ [? 7; [0, :;], while Ws) denotes the one-dimensional 
process defined by 


(8.9.13) Ws) = W(t,,...,tw—1 5) s20. 
Then Ws) is simply a zero-mean, Gaussian process on [0, oo) with the co- 
variance function 


N- 


E(W(GW(G)) = ( D (S ^s) 


i= 


That is, except for a scale factor, each Ws) is a one-dimensional Brownian 
motion. Thus each W, has a local time, (u, s) say. We then define a local time 
candidate p(u, t) for W itself by setting 


(89.14) uu, t) = Í Iu, ty) dt ++ dty 
A 


so that u is an integral of lower-dimensional local times. (Note that it is not 
trivial to justify the existence of this integral, and we refer the interested reader 
to the aforementioned references for details.) Integrating (89.14) over ue B c 2’, 


we obtain 
[ io. t du = Í li Iu, ty) au dt; :::dty., 
B Vi LB 


_ Í | f roo as| dt, «diy, 


since 44 is a local time for W,. But this is simply 


f m f “IKWO dt 


which is the oceupation mcasure v(B, t). Thus 44 as defined by (8.9.14), is in 
fact a local timc. 

To obtain information about the smoothness properties of ji it is necessary 
to derive expressions for its moments. Given the iterative nature of (8.9.14) this 
is not particularly difficult, since each W, is an indes 1 process and so Theorem 
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8.8.2 automatically yields bounds on the moments of (u, s) for each t, u, and s. 
For example, 
j 


k 
« I P f | E{ | yw (0, i 
i21 


on applying Lemma 1.3.5. The integrals are over k copies of V,, and we have 
neglected the k(N — 1) differentials di? --- di? ,. The expectation here can 
be bounded by Theorem 8.8.2 and the resulting integral computed. With the 
aid of such computations it is possible to establish the following result, a detailed 
proof of which relies on some results of Yadrenko (1971a) and is given in Adler 
(1980). 


E{| 20, 0| = el f UO, ty) dt, +--+ dty 


Theorem 8.9.3 


The Brownian sheet on I, = [e, 1]" has a jointly continuous local time that 
satisfies, with probability one, a Hélder condition of order x in the set variable for 
every a < 1 — (QN) £. 


It is also possible to prove a very special local time result for both the Brownian 
sheet and the isotropic Brownian motion, as follows. 


Lemma 8.9.3 


The local times u(u, t) of both the Brownian sheet and isotropic Brownian motion 
are, without probability one, strictly positive for every u belonging to the interior 
of the closure of their images. 


A proof of this lemma for W is given by Adler (1978c). The proof for B is the 
same. Both rely on the fact, proven by Ray (1963), that the theorem holds for 
N = 1, and use this and the iterative formulation (8.9.14) to prove the general 
result. Since this result gives us that these local times have no zeros in Im W or 
Im B, we can apply the above results to Theorems 8.7.3 and 8.7.4 in a straight- 
forward fashion to obtain the following theorem. 


Theorem 8.9.4 
Let £ > O and I, = [e, 1]". Then, if LAX) = {t € 1: X(t) = u}, 
1 = P{dim L,(W) = N — 4 for all ue Interior (W(/,))} 
= P{dim LAB) = N — } for all u € Interior (B(I,))]. 
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With this result we have obtained, for the multiparameter Brownian motions, 
results analogous to all those we established in the previous section for index-fj 
processes. However, the astute reader may have noticed that whereas the 
previous section dealt with #*-valued fields we have limited our attention here to 
real-valued Brownian motions. It is not hard to define generalized versions of 
these fields, by letting W(t) = (W,(0,..., W,(0) and B(t) = (B,(t)..., Ba(t)) 
be the #%-valued versions of W and B obtained by taking the W, and B; to be 
independent Brownian sheets and isotropic fields, respectively. It is also not 
hard to show that corresponding versions of Theorems 8.9.1 and 8.9.2 exist for 
these processes. 

For example, since B is simply an (N, d), index-(4, . . . , 4) Gaussian field, it is 
covered by the results of the preceding section. As far as W is concerned, Tran 
(1977a) has shown that 


dim(Im W) = min(d, 2N), 


with probability one, while Tran (1977b) then proved that 
: ; d 
dim(Gr W) = min| 2N, N + > as. 


Both of these results follow in a straightforward fashion from an application of 
the results of this chapter. 

However, when we turn to the task of determining the existence of a local time 
for W, serious difficulties arise. As we have already noted, the construction used 
in Theorem 8.8.2 for an LND field does not work for W, which is not LND. 
Furthermore, we cannot employ an iterative construction as in (8.9.14) since 
the one-dimensional local times u, will not exist if d > 1. Since at the time of writ- 
ing the existence of a local time for W when d > 1 has not been established, we 
shall have no more to say about either the local time or dimensional properties 
of this field. 

We shall conclude our brief study of W and B by looking briefly at one result 
which indicates the existence of even more bizarre consequences of their sample 
function erraticism than those we have already investigated. 

The simulation depicted in Figure 8.0.2 shows the level sets of the two- 
parameter Brownian sheet as simple (contour) lines in the plane. We now know 
that these lines are not simple at all, but extremely erratic, with a Hausdorff 
dimension of 14. A far more unexpected result has recently been proven in 
Kendall (1980), where it is shown that these ‘lines’ are not only lines, but in- 
elude uncountable unions of disjoint points. To formalize this, let W and B be 
defined on the N-dimensional cube 7, and let C(t) and Cat) denote, respec- 
tively, the connected component of the level sets isi 4, Wi Wei; and 
sc Fi BOS) BOL which contan t Then Kendall has proven the following 
remarkable results 
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Theorem 8.9.5 
(a) For each t € (0, 1], 
P{Cy(t) = t) = 1. 
(b) For each t  [0, 1]", 
P{C;(t) = tj = 1. 
(c) LetN =2and Í, = (te I:t, = Oor t, = 0}. Then 
P{C,(0) = Î} = 1. 
Thus the contours of W and B at any fixed point are, with probability one, 


always trivial, insofar as they consist of that point only. Given this theorem 
it is easy to obtain the following fascinating result, with which we conclude. 


Corollary 


The union of the non-trivial contours of either W or B has Lebesgue N-di- 
mensional measure zero with probability one. 


Proof 


The function t > P(Cy(t) z t} is zero everywhere in I,. Therefore 


E il Iicwo+y dt = 0, 
Io 


proving the corollary for W. The same proof works for B. 


Appendix. The Markov Property for Gaussian 
Fields 


Most of this book has been concerned with aspects of the sample function 
behaviour of Gaussian fields, particularly when these give rise to interesting 
geometrical problems. In general we have stayed away from purely probabilistic 
or structural-type problems, except when it became necessary to study these to 
facilitate the investigation of primary concern, as was, for example, the case 
with the spectral theory developed in Chapter 2 and Section 7.2. We now wish to 
look at one structural problem which is of deep intrinsic interest, despite the 
fact that it seems to have little to do, even indirectly, with sample function 
analysis. This is the problem of the possession, or otherwise, of a Markovian- 
type property by Gaussian fields. Our aim is to briefly cover the existent theory 
of Gauss-Markov fields. We shall not attempt to give proofs of any of the 
results of this theory. Such proofs generally rely on fairly involved complex 
analysis which is likely to be both unenlightening and of little interest to the non- 
specialist. 

Throughout this survey we shall have need to make repeated use of the 
c field generated by the values of a field X(t) as its index t runs over some set A. 
We write this as o(X(t), t e A). These c fields can be understood on two levels. 
On the one hand, they have an exact measure-theoretic interpretation. On the 
other hand, however, they can be regarded simply as the ‘information’ an 
observer can gather about the field X(t) if he knows all of its values for t € A. 
Although an understanding of the first type is a necessary prerequisite for follow- 
ing and producing proofs of the results presented below, an understanding of the 
second type is sufficient to appreciate their main content. Thus, we trust, this 
appendix should be relatively easy to follow, although the theory it presents, is, 
in its full form, mathematically demanding. 

As there is a great deal of difference between the Markovian structure of 
fields on .4*, N > lI. and processes on the real line, we start by initially re- 
stricting our discussion to the latter situation, where we have the following 
definition. 

T 


254 APPENDIX. THE MARKOV PROPERTY FOR GAUSSIAN FIELDS 
Definition A.1 


A real-valued stochastic process X(t), t € E c &', is called Markovian if, 
for every Borel set A of R’ and any t > t, >- > tin E, 


(A.1) P{X(t) e A|X(t,),..., Xt) = PIXO e A| X(t1)}. 


This definition has an equivalent formulation in terms of the c fields generated 
by the process. Specifically, we have the following definition. 


Definition A.2 
A real-valued stochastic process is Markovian if and only if for any 
A € o(X(s), s x t), Be o(X(s), s > t) 
and for each t € E, 
(A.2) P(AB|X(t)) = P{A| X()}P{B| XO}. 


If we interpret o(X(s), s € E*) as the information the process generates as s 
varies over the points of E* then what this definition says is that given precise 
information on the present behaviour of the process (i.e. the value of X(t)) the 
past and future behaviour are conditionally independent. The proof of the 
equivalence of these two definitions can be found in any standard text on Markov 
processes, e.g. Doob (1953). 

There is a wide variety of Gaussian processes on 4! which are Markovian. 
The simplest of these is the Brownian motion, W(t), which we have already met 
in other circumstances. An extensive class of Markov-Gaussian processes is 
provided by the so-called diffusion processes X(t) which arise as solutions of the 
stochastic differential equation 


(A.3) dX(t) = m(x, t) dt + W(O)o(x, t) dt], 


which holds conditionally on X(t*) = x for some t*. Here m and c are arbitrary, 
smooth, functions. For more information on these processes see, for example, 
Doob (1953). 

Both the Brownian motion and the Gaussian diffusion processes are non- 
stationary, and when we limit ourselves to stationary Gaussian process which 
are Markovian it turns out, rather surprisingly, that up to a scale factor there 
exists only one such process. This is the so-called Ornstein- U hlenbeck process, 
which has zero mean and an exponentially decaying covariance function R(t) — 
e 77, t > 0,a > 0.(See, for example, Feller, 1971, for a discussion of this process.) 
We can enlarge this rather small class of examples by slightly relaxing the 
definition of the Markov property, as in the following definition, 
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Definition A.3 


A real-valued stochastic process t € E € 4?! is called quasi-Markovian if, 
for any A € o(X(s), s < t, s > u), Beo(X(s), t < s < u), and for each pair t, u 
with t « u in E, 


(A.4) P{AB| X(t), X(u)) = P{A| XO, X(W) P{B| XC), X(u)). 


The importance of this concept lies in the fact that it avoids the necessity 
of using the linear ordering of 4?! that is so crucial in defining the ordinary 
Markov property. Since higher dimensional spaces do not exhibit this type of 
ordering it is not possible to talk about past and future being independent given 
the present, but one can talk about the interior of some set being independent 
of the exterior, given the boundary which, say, could be a simply connected 
closed smooth curve. This, in essence, is the type of property considered in (A.4). 
Chay (1972) has characterized all stationary, quasi- Markov, Gaussian processes 
on the real line as follows (see also Jamison, 1970). 


Theorem A.1 


Let X(t), t € [0, T] be a stationary Gaussian process with the zero-mean, unit 
variance, and continuous covariance function R(t). Then X is quasi-Markov if 
and only if R(t) has one of the following three forms: 


(A.5) R(t) = Ae" * + (1 — Aye", a>0,A > 3, 


where 


1 A 
T<— e(z 4.) for A > 1, 


2a A-1 
and 
1 A ; 
EPI for 1 >A>z, 
(A.6) R(t) = cos at, a>0,T x n/a, 
(A.7) R(t) = 1 — at, a>0,T x 2/a. 


That these processes represent a useful generalization of the ordinary 
Markov processes is evidenced, for example, by the work of Slepian (1961), 
where the quasi-Markov nature of the process with covariance (A.7) with a = | 
is used as an aid in deriving an expression for Pi X(t) © 4.0 € t « 1}. As we 
noted in. Section 69 the exact computation ob this type of probabilities is 
generally impossible. 
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The existence of useful, stationary quasi-Markov processes suggests that we 
could perhaps define a similar notion for random fields. Indeed, if we let 0D 
be an infinitely differentiable closed surface ((N — 1)-dimensional manifold) 
separating Z" into a bounded part D^ and an unbounded part D*, we say that a 
random field X(t), te Z", is quasi-Markovian if for any such ôD and any 
Aeoc(X(t,teD'), Beo(X(t, te D*), A and B are independent given o(X(t), 
t € OD). This seems a reasonable extension of the definition in one dimension, 
but rather surprisingly, leads to the following result. 


Theorem A.2 


Let X(t), t € R”, be a homogeneous Gaussian field, with continuous covariance 
function and quasi-Markovian in the above sense. Then: 


(a) X is completely determined by its values on any infinitely differentiable 
closed surface OD in the sense that, with probability one, 


(A.8) X(t) = E{X(t)| X(s), s € ôD}. 


(b) If X is also isotropic, X is degenerate in the sense that X(t) = X(0) for 
all t, where X(0) is a Gaussian variate. 


This rather remarkable result was proven by Wong(1969). (See also Yadrenko, 
1959.) Wong (1971) has an easily readable proof of Theorem A.2(b). In essence 
what this result states is that no simple extension of the one-dimensional 
Markov concept is likely to prove useful in higher dimensions. However, it is 
possible to develop a Markov-type structure for fields if, instead of conditioning 
on the values that the field takes on the boundary OD, we condition on a some- 
what larger set. To see how we might do this, let us look at a problem Lévy set 
(Lévy, 1956, p. 136) which started most of the work in Markov fields. 

Lévy’s problem was the following. We know that Brownian motion on 2?! 
is Markovian. Consider B(t), the isotropic Brownian motion in Z^, i.e. the 
zero-mean Gaussian field with covariance function 4{||t|| + Isi — It — sll} 
Lévy realized that B(t) could not be quasi-Markovian but he asked if, given a 
surface @D as before and given some information about the behaviour of Bin a 
neighbourhood of 0D, would events in o(X(t), t € D^) and o(X(t), te D^) be 
independent? In particular Lévy conjectured that for odd-dimensional spaces 
B(t), t e 4?* *!, was Markovian of order K, in the following sense. A random 
field is said to be Markovian of order <(K + 1) if, for any ôD as above, every 
sample function approximation X*(t) to X(t) in a neighbourhood of ôD which 
has the property 


lim d^ *|X(t) — X*(0| = 0, 


dl 0 
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where d = d(t) = inf, -ap [s — t|, also has the property that given X*, X(t) and 
X(s) are independent whenever t € D^ and s e D*. If X(t) is Markov of order 
<(K + 1) but not <K, then it is said to be Markov of order (K + 1). 

Although Lévy's conjecture has never been proven in quite its original 
formulation, McKean (1963) proved a related result in which he introduced a 
new Markovian notion for fields. To state McKean's result, which was the 
forerunner of the now substantial theory on Markov fields, we require some 
notation. In essence, we need to determine c fields that, given a particular ôD, 
correspond to the past, present, and future of X(t), as well as fields yielding 
information about the ‘immediate’ past and future. Thus, for each ôD as above, 
we set 


The ‘past? = ^ = o(X(t, te D^) 
The ‘future’ = X* = o( X(t), te D*) 
The ‘present’ = eX = ^ {o(X(t), te O) 
The ‘immediate past’? = 0X” = N {o(X(t), te Om D-)} 
The ‘immediate future’ = 0E* = ^ {a(X(t), te On D*)} 
where in each of the last three definitions the intersection is taken over all open 
sets O which contain ôD. 

It is worthwhile to note that calling 0X ‘the present’ of the process is actually 
somewhat misleading, for this o field contains more information than that 
generated by knowledge of the values of X on ôD. For example, if X possesses 
derivatives, then whereas o( X(t), t € 0D) contains no information about these 
derivatives, even for t € ôD, the larger o field 0X generally does. 

We need one more concept before we can define the Markov property we 


require. We say ac field X, c X^ is a splitting field for OD if, conditional on 2, 
past and future are independent, i.e. if for each A € X^, B e X*, 


(A.9) P(AB|X,j = P(AIZJP(B|X,. 


It is easy to see that X^ itself is a splitting field for OD, and it is straightforward 
to check that the intersection of two splitting fields is again a splitting field. 
Hence there exists a minimal splitting field, obtained by taking the intersection 
of all splitting fields. We can now introduce the following definition. 


Definition A.4 


A real-valued random field X(t), t € A~, is called pseudo-Markovian if for each 
infinitely differentiable closed surface 0D c 9f" we have 


(A.10) CL! = 0E =A 
and 


(A.11) OX is the minimal splitting ficld for cD 
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Condition (A.10) is of a technical nature and says we see the same things in the 
very near future as in the very recent past. Thus 9X c X* n X^ and the defini- 
tion requires that the splitting field of ôD is as small as possible and involves 
little more than boundary data. 

Although the analogy with one dimension is clear, there are many processes 
on 2?! which are pseudo-Markovian but not Markovian in the usual sense. An 
example is readily provided by the integrated Brownian motion defined by 


X(t) — [20 dt. 
0 


This is pseudo-Markovian, but not Markovian. However, the vector-valued 
process (X(t), B(t)) is Markovian in the usual sense. Indeed, there are many 
examples of this type; i.e. if the bivariate process (X(t), dX(t)/dt) is Markovian, 
then X is pseudo-Markovian. 

With essentially this formulation, McKean (1963) showed that the isotropic 
Brownian motion was pseudo-Markovian in odd dimensions, but not in even 
dimensions where no Markov-type property at all holds. He actually went 
somewhat further than this, for he succeeded in identifying the minimal splitting 
field 9X. That is, he determined what information one needed about B in the 
vicinity of ôD to make events in X^ and X* conditionally independent. In 
essence, in the (2K + 1)-dimensional case, 0X turned out to be the information 
generated by B and its ‘normal derivatives’ ôB, i = 1,..., K, on 6D. However, 
since B is not even once differentiable (cf. Section 8.9) these derivatives need to 
be carefully defined, which we shall now do, following the formulation of Pitt 
(1971). 

Let do be the surface measure on ôD and let t be the unit normal vector to 
OD at the point t € ôD. For each function f (t) which vanishes outside ofa compact 
set in 0D and for which Im f(t)|? dt < oo, we introduce the function 


(A.12) F(h) -f f(OB(t + ht) do(t) 
ôD 


of the real variable h. Then if ôD has dimension (N — 1) it is not hard to show 
that F is (N — 1) times continuously differentiable. We write its derivatives as 
FO), F... It is these derivatives that correspond to the normal derivatives of 
B. We now introduce the ‘differential’ o field 


(A.13) Ly = c(F(0):0 < k < K, f as above}, 


for K = L..., N — 1. McKean showed that in the (2K + 1)-dimensional case 
Xy was the minimal splitting field for ôD. 

Pitt (1971) has carried this idea over to Gaussian fields other than the 
isotropic Brownian motion, and introduced the following definition. 
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Definition A.5 


Let X(t), t € B® be a random field and 0D as above. Suppose F(h) as defined by 
(A.12) with X replacing B has p continuous derivatives at h = 0. Then if the 
differential field X, defined by (A.13) is the minimal splitting field of 0D, we say 
that X is pseudo-Markovian of order p. 


Since it is straightforward to see that Z,, when it exists, is contained in the 
intersection of 0X^ and dX*, and, furthermore, contains 6X, it follows that if a 
field is pseudo-Markovian of any order and (A.10) is in force then the field is 
also pseudo-Markovian in the sense of Definition A.4. 

For homogeneous Gaussian fields it is possible to determine the type of 
Markov condition the field satisfies from its spectrum. To see how this works 
let X be such a field, and suppose it has spectral density f(A), X € 4". Then a 
result of Pitt (1971) yields the following theorem. 


Theorem A.3 


A homogeneous Gaussian field is pseudo-Markovian of order p with respect to 
bounded surfaces CD if and only if the inverse of its spectral density can be ex- 
pressed as a polynomial 


(A.14) Y au aA ARN, 
k 
with |k| = k, + --- + ky < 2p, such that there exists a constant C > 0 with 
(A.15) CIA € | Y a V AME AY 
Ik] 7 2p 


As an example of this theorem, let us consider an N-dimensional analogue of 
the Ornstein-Uhlenbeck process on 4?!; viz. let X(t) be the homogeneous 
Gaussian random field on Æ” with zero mean and the covariance function 


(A.16) R(t) = exp(—a,t, — --- — ayty). 


Then it is easy to check that the spectral density of this field is of the form 
N 
fO)= CT] + A^, 
ict 


with C a constant depending only on the x. The inverse of f is clearly of the 
form (A.14) with [k| < 2N. Thus the Gaussian field with covariance (A.16) 
is pseudo-Markovian of order N. Note that; unlike the case with the isotropic 
Brownian motion, this property holds regardless of whether N is odd or even 
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As a second example, it is easy to check that the homogeneous Gaussian field 
with zero mean and the covariance function 


N 
(A.17) R(t) = [] max[0, 1 — Itl] 
i=1 
has the spectral density 
N 
SA = [I4 
i-1 


so that it, too, is pseudo-Markovian of order N. In one dimension the covariance 
function (A.17) is the triangular covariance of (A.7) and the corresponding 
process is quasi-Markovian. In general (A. p yields the 1-triangular covariance 
field of Section 6.9. 

If we do not wish to specify a particular order of the pseudo-Markovian 
property, it is possible to obtain weaker conditions on the spectrum than (A.14) 
and (A.15). For example, suppose the inverse of the spectral density is locally 
integrable, i.e. 


(A.18) f~'Q) di «oo for every compact A c A”, 
A 


and that f (à) does not decay too quickly as ||A|| > oo. Specifically, suppose there 
exists a C >O and a non-negative, non- decreasing function T(x), x > C, 
such that 


(A.19) SQ) > exp[ — T(I.I)] for all sufficiently large ||% || 


and 
(A.20) Í T(x)x^? dx < œ. 
C 


Then Kotani (1973) has proven the following theorem. 


Theorem A.4 


A homogeneous Gaussian field whose spectral density satisfies (A.18) to (A.20) 
is pseudo-Markovian if and only if f ^ ! (X) agrees, for almost every X, with the 
restriction to Z” of an entire function of minimal exponential type. 


(A function g(z) on the N-dimensional complex space C" of N-tuples of 
complex numbers is called entire if its domain is the whole of C" and it is con- 
tinuous and analytic in each variable separately. Furthermore, it is said to be of 
minimal exponential type if for each ¢ > 0 there exist a finite M > 0 for which 


Ig(2)| € M exp(e|z|) for all z e CN. 
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Such functions always possess everywhere convergent power series expansions 
of the form 


g(z) = » 0, ky Ze e ZN.) 
k 


It should be clear from this result that as we progress into a more detailed 
study of Markovian properties for fields even the statements of the theorems 
require a knowledge of complex analysis to be fully appreciated. Thus we shall 
cease our survey at this point, and conclude by indicating where the interested 
reader should turn for further information. 

In a series of papers Pitt (1971, 1973, 1975a, 1975b, 1975c) has presented a 
detailed development of the above theory, including a more general version 
of Theorem A.4 and studies of yet another type (L-Markovian) of multiparam- 
eter Markov structure. To define this notion let L be a compact subset of 2% 
that includes the origin. Setting D7 + L = {t + s, te D',se L}, we define the 
L boundary of D^ to be the set: 


oQ,D-(D + L)n D*. 


Then X is said to be L-Markovian if, essentially, values of X in D^ and 
(D^ o ô DY are conditionally independent given the values of X in 0; D. A 
field that is L-Markovian for every such L, no matter how small, is clearly 
pseudo-Markovian. We refer the interested reader to Pitt's papers for further 
information about this property, including analogues of Theorem A.4 for the 
L-Markov property. 

Other related papers, presenting results in the same spirit as the ones we have 
considered, are Cartier (1971), Kontani and Okabe (1973), Moléan (1967, 1971), 
Okabe (1973a, 1973b), and Yadrenko (1971b). There is also a large number of 
papers devoted to Markovian properties for the so-called 'generalized pro- 
cesses’. Examples of these are Kallianpur and Mandrekar (1974), Piterbarg 
(1976), Rozanov (1967, 1977, 1979), and Urbanik (1962). A very early paper by 
Whittle (1963) discusses the generation of Markov-type fields as solutions to 
stochastic partial differential equations on the plane, and still represents 
interesting reading. 
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